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PREFACE 


Statistical Mechanics is the extended version of my earlier text, Statistical Thermo- 
dynamics, whose Preface is reprinted following these pages. The present volume is 
intended primarily for a two-semester course or for a second one-semester course in 
statistical mechanics. Whereas Statistical Thermodynamics deals principally with equi- 
librium systems whose particles are either independent or effectively independent. 
Statistical Mechanics treats equilibrium systems whose particles are strongly inter- 
acting as well as nonequilibrium systems. The first eleven chapters of Statistical 
Thermodynamics form also the first eleven chapters of Statistical Mechanics . Chapter 
15, Imperfect Gases, becomes Chapter 12 of the present volume, and the next ten 
chapters, 13-22, are completely new. Chapter 13 deals with the radial distri- 
bution function approach to liquids, and Chapter 14 is a fairly detailed discussion of 
statistical mechanical perturbation theories of liquids. These theories were developed 
in the late 1960s and early 1970s and have brought the numerical calculation of the 
thermodynamic properties of simple dense fluids to a practical level. A number of the 
problems at the end of Chapter 14 require the student to calculate such properties 
and compare the results to experimental data. Chapter 15, on ionic solutions, is the 
last chapter on equilibrium systems. Section 15-2 is an introduction to the recent 
(1970s) advances in ionic solution theory that now allow one to calculate the thermo- 
dynamic properties of simple ionic solutions up to concentrations of 2 molar. 

Chapters 16-22 treat systems that are not in equilibrium. Chapters 16 and 17 are 
meant to be somewhat of a review, although admittedly much of the material, par- 
ticularly in Chapter 17, will be new. Nevertheless, these two chapters do serve as a 
background for the rest. Chapter 18 presents the rigorous kinetic theory of gases as 
formulated through the Boltzmann equation, the famous integro-differential equation 
whose solution gives the nonequilibrium distribution of a molecule in velocity space. 
The long-time or equilibrium solution of the Boltzmann equation is the well-known 
Maxwell-Boltzmann distribution (Chapter 7). Being an integro-differential equation, 
it is not surprising that its solution is fairly involved. We only outline the standard 
method of solution, called the Chapman-Enskog method, in Section 19-1, and the 
next two sections are a practical calculation of the transport properties of gases. In the 
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last section of Chapter 19 we discuss Enskog’s ad hoc extension of the Boltzmann 
equation to dense hard-sphere fluids. Chapter 20, which presents the Langevin equa- 
tion and the Fokker-Planck equation, again is somewhat of a digression but does 
serve as a background to Chapters 21 and 22. 

The 1950s saw the beginning of the development of a new approach to transport 
processes that has grown into one of the most active and fruitful areas of nonequi- 
librium statistical mechanics. This work was initiated by Green and Kubo, who showed 
that the phenomenological coefficients describing many transport processes and time- 
dependent phenomena in general could be written as integrals over a certain type of 
function called a time-correlation function. The time-correlation function associated 
with some particular process is in a sense the analog of the partition function for 
equilibrium systems. Although both are difficult to evaluate exactly, the appropriate 
properties of the system of interest can be formally expressed in terms of these func- 
tions, and they serve as basic starting points for computationally convenient approxi- 
mations. Before the development of the time-correlation function formalism, there 
was no single unifying approach to nonequilibrium statistical mechanics such as Gibbs 
had given to equilibrium statistical mechanics. 

Chapters 21 and 22, two long chapters, introduce the time-correlation function 
approach. We have chosen to introduce the time-correlation function formalism 
through the absorption of electromagnetic radiation by a system of molecules since 
the application is of general interest and the derivation of the key formulas is quite 
pedagogical and requires no special techniques. After presenting a similar application 
to light scattering, we then develop the formalism in a more general way and apply 
the general formalism to dielectric relaxation, thermal transport, neutron scattering, 
light scattering, and several others. 

There are also eleven Appendixes. The first three are from Statistical Thermo- 
dynamics, but the rest are new and are meant to supplement the textual material. Of 
particular interest is Appendix D, which lists a FORTRAN program for the hard- 
sphere radial distribution. This program allows one to apply the equations of Chapter 
14 (perturbation theories of liquids) with ease and is central to a new, practical approach 
to the theory of liquids. 

As in Statistical Thermodynamics , the intention here is to present a readable intro- 
duction to the topics covered rather than a rigorous, formal development. In addition, 
a great number of problems is included at the end of each chapter in order either to 
increase the student’s understanding of the material or to introduce him or her to 
selected extensions. Generally, this volume is an extension of Statistical Thermo- 
dynamics to more complex and more interesting systems, but the spirit and level of 
the two are comparable. 


Donald A. McQuarrie 



Preface to 

Statistical Thermodynamics 


Statistical Thermodynamics represents the expansion of notes for a one-semester, 
first-year graduate course in statistical mechanics. The present volume comprises the 
first 15 chapters of a larger work entitled Statistical Mechanics that is to be published 
in 1976. There is intentionally more material in the present volume than should be 
necessary for a single semester; additional topics have been included in an attempt 
to recognize each instructor’s personal prejudices. It is assumed that students have had 
a standard course in physical chemistry but have not previously had a serious course 
in quantum mechanics. The mathematical prerequisite of the book is, at most, a 
knowledge of linear differential equations with constant coefficients. All of the neces- 
sary mathematical techniques have been developed within the book itself, either in the 
main text, in problems, or in Appendix B. 

There is no doubt that the term “statistical thermodynamics” or “statistical 
mechanics ” either frightens or brings forth scorn from many people. To eradicate 
this attitude, a serious attempt has been made to write this book with the experi- 
mentalist in mind, whether he be instructor or student. There are extensive appli- 
cations of the formal equations to experimental data both in the text itself and in the 
numerous applied problems. For example, a large number of figures and tables in the 
book compare theoretical predictions to experimental results for a variety of real 
systems. This has, I hope, not been done at the expense of a sensible degree of rigor or 
logical clarity; I have tried to achieve a balance in this respect. What I ask of the 
student is to make a serious effort to conceptualize and abstract for the first few 
chapters, because from then on he will find that practical results and rewards come 
quite rapidly and easily. 

The first chapter is called a review, but no doubt much of the material will be 
new to many students. The topics included here are particularly those points of 
classical mechanics, quantum mechanics, and mathematics that play a central role 
in the development of statistical mechanics. Chapter 2 introduces the fundamental 
postulates and concepts of statistical mechanics with particular reference to the 
so-called canonical ensemble, in other words, to thermodynamic (macroscopic) sys- 
tems with fixed volume, temperature, and number of particles. This chapter is fairly 
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formal, but it concludes with problems that anticipate the utility of the formal 
equations derived in the chapter. 

The approach used to develop satistical mechanics can be credited (at least in 
the textbook literature) to T. L. Hill’s Introduction to Statistical Thermodynamics 
(1960). There are basically two reasons for this choice: It is the most appealing ap- 
proach from both a logical and pedagogical point of view, but probably more to the 
point, I was a student of Hill shortly after the publication of his book and grew up 
statistical-mechanically with his approach. 

Chapter 3 extends the formal development of Chapter 2, and Chapter 4 illustrates 
how the equations of Chapters 2 and 3 may be reduced to very practical formulas 
that are applicable to an amazing variety of systems. These applications, in fact, 
largely constitute the balance of this volume. The common feature of the systems dis- 
cussed in Chapters 5 through 14 is this: Atoms or molecules making up these systems 
are either independent or “ effectively ” independent. This results in equations that are 
not only easy to use but lead to comparisons with experiment that are usually quite 
decent and rewarding. 

The final chapter investigates imperfect gases, the simplest system in which atoms 
or molecules cannot be treated as independent or even effectively independent. Here 
the intermolecular potential begins to emerge as a major complicating factor. A prin- 
cipal goal of this chapter is the introduction of one of the more advanced techniques 
that can be used to treat complicated systems. Chapters 13 through 15 of the larger 
volume Statistical Mechanics (which might be covered in the first weeks of a second 
semester) treat other systems in which the intermolecular potential plays a central 
role. The inclusion of Chapter 15 on imperfect gases in the present volume is to assure 
the skeptical student that statistical mechanics is quite capable of treating general 
nonindependent systems — and possibly to entice him into a more challenging and much 
more exciting second semester when he can, in fifteen weeks, be brought close to a 
“ literature level ” in statistical mechanics. 

Although this book has evolved from teaching a course in statistical mechanics 
for several years, much effort has been made to produce a book that is equally suitable 
for self-study. (I have also kept in mind the apprehensive instructor who is not a pro- 
fessional theoretical chemist.) To this purpose, there is a fairly extensive and specific 
bibliography available at the end of each chapter to direct either student or instructor 
to other treatments of the material covered. Obviously no book of this type is written 
in a vacuum, and the bibliography represents those sources that were particularly 
useful to me. 

One of the most important and useful features of this book is the extensive collec- 
tion of problems presented after each chapter. The only way to learn statistical 
mechanics, or any other physical chemical subject, is to work a great many problems. 
Consequently, there are over 450 problems in the book, ranging in difficulty from 
“ plug in the number and get an answer” to those that require little writing but much 
ingenuity. A few of the problems fill in steps leading from one equation to another, 
while others extend material to research papers in the literature. Most of the problems, 
however, are intended as applications of the equations and concepts presented in 
chapters. It is primarily the problems that dictate the level of a course such as statistical 
mechanics, and the variety and number included here allow an instructor to present a 
straightforward application-oriented course that should be perfectly reasonable for 
seniors or a fairly demanding introductory course for theoretical graduate students. 


D. A. M. 
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CHAPTER 1 


INTRODUCTION 
AND REVIEW 


1-1 INTRODUCTION 

Statistical mechanics is that branch of physics which studies macroscopic systems 
from a microscopic or molecular point of view. The goal of statistical mechanics is 
the understanding and prediction of macroscopic phenomena and the calculation of 
macroscopic properties from the properties of the individual molecules making up the 
system. 

Present-day research in statistical mechanics varies from mathematically sophis- 
ticated discussions of general theorems to almost empirical calculations based upon 
simple, but nevertheless useful, molecular models. An example of the first type of 
research is the investigation of the question of whether statistical mechanics, as it is 
formulated today, is even capable of predicting the existence of a first-order phase 
transition. General questions of this kind are by their nature mathematically involved 
and are generally beyond the level of this book. We shall, however, discuss such 
questions to some extent later on. On the other hand, for many scientists statistical 
mechanics merely provides a recipe or prescription which allows them to calculate the 
properties of the physical systems which they are studying. 

The techniques of statistical mechanics have been used in attacking a wide variety 
of physical problems. A quick glance through this text will show that statistical 
mechanics has been applied to gases, liquids, solutions, electrolytic solutions, polymers, 
adsorption, metals, spectroscopy, transport theory, the helix-coil transition of DNA, 
the electrical properties of matter, and cell membranes, among others. 

Statistical mechanics may be broadly classified into two parts, one dealing with 
systems in equilibrium and the other with systems not in equilibrium. The treatment 
of systems in equilibrium is usually referred to as statistical thermodynamics , since it 
forms a bridge between thermodynamics (often called classical thermodynamics) and 
molecular physics. 

Thermodynamics provides us with mathematical relations between the various 
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experimental properties of macroscopic systems in equilibrium. An example of such 
a thermodynamic relation is that between the molar heat capacities at constant 
pressure and at constant volume. 



Another, and one that we shall use in the next chapter, is 



Note that thermodynamics provides connections between many properties, but does 
not supply information concerning the magnitude of any one. Neither does it attempt 
to base any relation on molecular models or interpretations. This, in fact, is both the 
power and weakness of thermodynamics. It is a general discipline which does not 
need to recognize or rely upon the existence of atoms and molecules. Its many relations 
would remain valid even if matter were continuous. In addition, there are many systems 
(such as biological systems) which are too complicated to be described by an acceptable 
molecular theory, but here again the relations given by thermodynamics are exact. 
This great generality, however, is paid for by its inability to calculate physical proper- 
ties separately or to supply physical interpretations of its equations. When one seeks 
a molecular theory which can do just this, one then enters the field of statistical 
thermodynamics. Thus thermodynamics and statistical thermodynamics treat the same 
systems. Thermodynamics provides general relations without the need of ever consider- 
ing the ultimate constitution of matter, while statistical thermodynamics, on the other 
hand, assumes the existence of atoms and molecules to calculate and interpret thermo- 
dynamic quantities from a molecular point of view. 

Statistical thermodynamics itself may be further divided into two areas: first, the 
study of systems of molecules in which molecular interactions may be neglected (such 
as dilute gases), and second, the study of systems in which the molecular interactions 
are of prime importance (such as liquids). We shall see that the neglect of intermolec- 
ular interactions enormously simplifies our problem. Chapters 4 through 11 of the 
book are devoted to the treatment of systems in which these interactions either may 
be ignored or highly simplified. This is the kind of statistical thermodynamics to which 
most undergraduates have been exposed and, to some extent, represents typical 
statistical thermodynamical research done in the 1930s. The more interesting and 
challenging problems, however, concern systems in which these molecular interactions 
cannot be neglected; Chapters 12 through 15 of the present volume are devoted to the 
study of such systems. It is in this latter area that a great deal of the research of the 
1940s, 1950s, and 1960s was carried out. There are, of course, many important 
problems of this sort still awaiting attack. The theory of concentrated electrolyte 
solutions and the proof for the existence of first-order phase transitions are just two 
examples. 

The most difficult branch of statistical mechanics, both mathematically and con- 
ceptually, is the study of systems not in equilibrium. This field is often referred to as 
nonequilibrium statistical mechanics . This is presently a very active area of research. 

There are still some important unsolved conceptual problems in nonequilibrium 
statistical mechanics. Nevertheless, in the 1950s great strides were made toward the 
establishment of a firm basis for nonequilibrium statistical mechanics, commensurate 
with that of equilibrium statistical mechanics, or what we have called statistical 
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thermodynamics. Chapters 16 through 22 of Statistical Mechanics present an intro- 
duction to some of the more elementary of these fairly new and useful concepts and 
techniques. 

In Chapter 2 we shall introduce and discuss the basic concepts and assumptions of 
statistical thermodynamics. We shall present these ideas in terms of quantum mechan- 
ical properties such as energy states, wave functions, and degeneracy. Although it 
may appear at this point that quantum mechanics is a prerequisite for statistical 
thermodynamics, it will turn out that a satisfactory version of statistical thermo- 
dynamics can be presented by using only a few quantum mechanical ideas and results. 
We assume that the student is familiar with only the amount of quantum mechanics 
taught in most present-day physical chemistry courses. About the only requirement of 
the first few chapters is an understanding that the Schrodinger equation determines 
the possible energy values Ej available to the system and that these may have a 
degeneracy associated with them which we denote by ft (£,-). 

Before discussing the principles, however, we shall present in this chapter a discus- 
sion of some of the terms or concepts that are particularly useful in statistical thermo- 
dynamics. In Section 1-2 we shall treat classical mechanics, including an introduction 
to the Lagrangian and Hamiltonian formalisms. In Section 1-3 we shall briefly review 
the main features of quantum mechanics and give the solutions of the Schrodinger 
equation for some important systems. The only new material in this section to most 
students probably will be the discussion of the eigenvalues or energy levels of a many- 
body system. Then in Section 1-4 we shall review thermodynamics briefly, since it is 
assumed that the reader is familiar with the three laws of thermodynamics and the 
tedious manipulations of partial derivatives. Two important topics that are not 
usually discussed in elementary physical chemistry texts are introduced, however. 
These two topics are the Legendre transformation and Euler’s theorem, both of which 
are useful in studying statistical thermodynamics. Finally, in Section 1-5 we shall 
discuss some mathematical techniques and results that are particularly useful in statis- 
tical thermodynamics. Much of this section may be new material to the reader. 

1-2 CLASSICAL MECHANICS 
NEWTONIAN APPROACH 

Everyone knows the equation F = ma. What this equation really says is that the 
rate of change of momentum is equal to the applied force. If we denote the momentum 
by p, we have then a more general version of Newton’s second law, namely, 

do 

S“*-F ‘’- 31 

If the mass is independent of time, then dp/dt = mdrldt = mf = mo. If F is given as a 
function of position F(x, y, z), then Eq. (1-3) represents a set of second-order differen- 
tial equations in x , y 9 and z whose solutions give x, j>, and z as a function of time if 
some initial conditions are known. Thus Eq. (1-3) is called an equation of motion. We 
shall consider three applications of this equation. 

Example 1. Solve the equation of motion of a body of mass m shot vertically 
upward with an initial velocity v 0 in a gravitational field. 

If we choose the x-axis (positive in the upward direction) to be the height of the 
body, then we have 

mx = — mg 
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where mg is the magnitude of the force. The negative sign indicates that the force is 
acting in a downward direction. The solution to this differential equation for x is then 

x(t) = -igt 2 + v 0 1 + x 0 (1-4) 


with x 0 equal to x(0), which in our case is 0. This then gives the position of the body 
at any time after it was projected. The extension of this problem to two dimensions 
(i.e., a shell shot out of a cannon) and the inclusion of viscous drag on the body are 
discussed in Problems 1-1 and 1-2. 

Example 2. Set up and solve the equation of motion of a simple harmonic oscillator. 

Let x 0 be the length of the unstrained spring. Hooke’s law says that the force on 
the mass attached to the end of the spring is F = —k(x — x 0 ). If we let £ = x — x 0 , we 
can write 


s + -«=° 

at 1 m 


(1-5) 


whose solution is 

f (0 = A sin cot + B cos cot (1-6) 

The quantity 

co = (kjm) 1/2 (1-7) 


is the natural vibrational frequency of the system. Equation (1-6) can be written in an 
alternative form (see Problem 1-5) 

{(f) = C sin (cot + <f>) (1-8) 


This shows more clearly that the mass undergoes simple harmonic motion with 
frequency co. Problems 1-3 through 1-5 illustrate some of the basic features of simple 
harmonic motion. 

Example 3. Two-dimensional motion of a body under coulombic attraction to a 
fixed center. 

In this case the force is F = — Ar/r 3 , that is, it is of magnitude — K r 2 and directed 
radially. Newton’s equations become 


mx = F x = 


Kx 

(x 2 + y 2 ) 3/2 


™y = Fy=~ (x 2 + -y, 2 O- 9 ) 

Unlike our previous examples, these two equations are difficult to solve. Since the 
force depends, in a natural way, on the polar coordinates r and 0, it is more convenient 
for us to set the problem up in a polar coordinate system. Using then 

x = r cos 6 
y = r sin 6 


and some straightforward differentiation, we get 

| m{r — 6 2 r) + -^jcos 6 — m(r8 + 20r)sin 0 = 0 (1—1 0a) 

|/n(r — 6 2 r) + -jjsin 0 + m(r6 + 20r)cos 0 = 0 


(1-1 0b) 
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By multiplying the first of these equations by cos 0 and the second by sin 0 and then 
adding the two, one gets 

m(f — 6 2 r) + — = 0 (1-11) 

But this is just the term in braces in Eq. (1-10), which leads us to the result that 
m(r0 + 20r) = O (1-12) 

as well. Equation (1-12) can be written in the form 


i 1(^.0 


(1-13) 


which implies that 
mr 2 6 = constant 


(1-14) 


This quantity, mr 2 6 , which maintains a fixed value during the motion of the particle, 
is called the angular momentum of the particle and is denoted by /. The angular 
momentum is always conserved if the force is central, that is, directed along r (see 
Problem 1-10). 

Equation (1-14) can be used to eliminate 0 from Eq. (1-1 1) to give an equation in r 
alone, called the radial equation: 


mf — 


l 2 K 
mr 3 + r 2 


= 0 


(1-15) 


This equation can be solved (at least numerically) to give r(t), which together with 
Eq. (1-14) gives 6(t). 

Even though the solution in this example is somewhat involved using polar coordi- 
nates, it is nevertheless much easier than if we had used Cartesian coordinates. This 
is just one example of many possibilities, which show that it is advantageous to 
recognize the symmetry of the problem by using the appropriate coordinate system. 

This example was introduced, however, to illustrate another important point. 
Notice that Eq. (1-15) for r can be written as a Newtonian equation (i.e., in the form 
F=ma) 

K l 2 
mf = — r + — = 
r z mr 

if we interpret the term l 2 \mr 3 as a force. This force is the well-known centrifugal 
force and must be introduced into the equation for mf. 

This constitutes the main disadvantage of the Newtonian approach. The form of 
the equation mij = F n (where rj is some general coordinate) is useful only in Cartesian 
systems, unless we are prepared to define additional forces, such as the centrifugal 
force in the above example. At times these necessary additional forces are fairly 
obscure. 

There exist more convenient formulations of classical mechanics which are not tied 
to any one coordinate system. The two formulations that we are about to introduce 
are, in fact, independent of the coordinate system employed. These are the Lagrangian 
and the Hamiltonian formulations. 
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LAGRANGIAN APPROACH 

Let K be the kinetic energy of a particle. In Cartesian coordinates 
K(x, y,z) = j (x 2 + y 2 + z 2 ) 

Let the potential energy be U. In many problems U is a function of position only, and 
so we write U(x 9 y, z ). Newton’s equations are 

dU 

mx = — — 
dx 

with similar equations for y and z. Now introduce a new function 

L(x 9 y, z, x, y, z) = K(x 9 y, z) - U(x 9 y, z) 

This function is called the Lagrangian of the system. In terms of L 9 we have 

dL dK 
T - = — = mx 
dx ox 

dx dx 


and we can write Newton’s equations in the form 
d_ (dL\ _ dL 
dt \dx/ dx 


d-16) 


with similar equations for y and z. These are Lagrange’s equations of motion in 
Cartesian coordinates. The remarkable and useful property of Lagrange’s equations 
is that they have the same form in any coordinate system. If the x 9 y, z are transformed 
into any other system, say q l9 q 2 , , Lagrange’s equations take the form 


dt [dqj dqj 


j =1,2, 3 


(1-17) 


This can be proved by writing x = x(q l9 q 2 , q 3 ) 9 y = y(q l9 q 2 , q 3 ) 9 and z = z(q l9 q 2 , q 3 ) 
and then transforming Eq. (1-16) into Eq. (1-17). (See Problem 1-13.) 

Lagrange’s equations are more useful than Newton’s equations in many problems 
because it is usually much easier to write down an expression for the potential energy 
in some appropriate coordinate system than it is to recognize all the various forces. 
The Lagrangian formalism is based on the potential energy of the system, whereas the 
Newtonian approach is based on the forces acting on the system. 

To illustrate the utility of the Lagrangian approach, we shall redo Example 3, the 
two-dimensional motion of a particle in a coulombic force field. 

Example 3'. The kinetic energy is 


K = ™(.x 2 + y 2 ) = ™(r 2 + r 2 & 2 ) 


and the potential energy is U = — K’r . The Lagrangian, then, is 
Ur, e, r,&) = y (r 2 + r 2 6 2 ) + y 


(1-18) 
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The two Lagrangian equations of motion are 

d(dL\_dL 
dt \ dr) dr 

d_ /8L\ __ dL 
It\d6) ~~86 

or using Eq. (1-18) for L, 

d, . s . 7 K 
dt ' ' r 2 

4(mr 2 6 ) = 0 
dt ' 

These two equations are just Eqs. (1-11) and (1-13). Note, however, that they were 
obtained in a much more straightforward manner than were Eqs. (1-11) and (1-13). 
Problems 1-10 through 1-12 further illustrate the utility of the Lagrangian formula- 
tion. Other problems involve the motion of one and two particles in central force fields. 

Equations (1-17) are three second-order ordinary differential equations. To com- 
pletely specify the solutions, we need three initial velocities ^(0), q 2 (0), g 3 (0), and three 
initial positions #i(0), q 2 (0)> # 3 (0). These six initial conditions along with Lagrange’s 
equations completely determine the future (and past) trajectory of the system. If there 
were N particles in the system, there would be 3 N Lagrange equations and 6 N initial 
conditions. 

There is another formulation of classical mechanics that involves 6 N first-order 
differential equations. Although this formulation is not as convenient as Lagrange’s 
for solving problems, it is more convenient from a theoretical point of view, partic- 
ularly in quantum mechanics and statistical mechanics. This is the Hamiltonian 
formulation. 


HAMILTONIAN APPROACH 

We define a generalized momentum by 



j = 1,2,..., 3N 


(1-19) 


This generalized momentum is said to be conjugate to qj . Note that Eq. (1-19) is 
simply p x — mx, and so on, in Cartesian coordinates. 

We now define the Hamiltonian function for a system containing just one particle 
(for simplicity) by 

3 

ff(Pu Pi, P3, <h, q 2 ,q 3 ) = £ Pj Qj - L(4u Qi,Q 3 , Qi, Qi , Qi) (1-20) 

j= 1 


It is understood here that the qfs have been eliminated in favor of the p/s by means 
of Eq. (1-19). 

An important difference between the Lagrangian approach and the Hamiltonian 
approach is that the Lagrangian is considered to be a function of the generalized 
velocities q } and the generalized coordinates q j9 whereas the Hamiltonian is con- 
sidered to be a function of the generalized momenta p } and the conjugate generalized 
coordinates qj . This may appear to be a fine distinction at this point, but it will turn 
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out to be important later on. It also may seem, at this time, that the definition 
Eq. (1-20) is rather obscure, but we shall give a motivation for its form in Section 1-4. 
(See Problem 1-38.) 

For the kinds of systems that we shall treat in this book, the kinetic energy is of the 
form 


3 N 

K = X a j{ ( }\'> <72 > • • • * fo/v)#/ (1-21) 

j = i 

that is, a quadratic function of the generalized velocities. The coefficients a } are, in 
general, functions of generalized coordinates but not an explicit function of time. If, 
furthermore, the potential energy is a function only of the generalized coordinates, 
then the p y occurring in Eq. (1-20) are given by 


dL dK 

Pj p ■ n • Qj 

dqj dqj 


where the last equality comes from Eq. (1-21). Substituting this into Eq. (1-20) gives 
the important result 


H = K + U = total energy 


( 1 - 22 ) 


We shall now show that if the Lagrangian is not an explicit function of time, then 
dH,dt = 0. We begin with the definition of //, that is, Eq. (1-20). 


dH = Y qj d Pj + £ Pj dqj -'£ — dq j -'Z — dqj 

But if we use Eqs. (1-17) and (1-19), we see that 

dH=Y. <?; dpj ~ £ Pj dqj ( 1 - 23 ) 

The total derivative of H is (assuming no explicit dependence on time) 


dH — 




( 1 - 24 ) 


Comparing Eqs. (1-23) and (1-24), we get Hamilton’s equations of motion: 


dH 

dPj 


= <lj 



J = 1> 2, . . . , 3N 


( 1 - 25 ) 


Hamilton’s equations are 6 N first-order differential equations. It is easy to show from 
Eqs. (1-24) and (1-25) that dH/dt = 0. (See Problem 1-14.) This along with Eq. (1-22) 
says that energy is conserved in such systems. 

Since the Hamiltonian is so closely related to the energy, and it is the total energy 
which is usually the prime quantity in quantum and statistical mechanics, the Hamil- 
tonian formalism will turn out to be the most useful from a conceptual point of view. 
Fortunately, however, we shall never have to solve the equations of motion for macro- 
scopic systems. The role of statistical mechanics is to avoid doing just that. 


1-3 QUANTUM MECHANICS 

In the previous section we have seen that a knowledge of the initial velocities and 
coordinates of a particle or a system of particles was sufficient to determine the future 
course of the system if the equations of motion, essentially the potential field that the 
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system experiences, are known. If the state (its velocities and coordinates) of the 
system is known at time t 0 , then classical mechanics provides us with a method of 
calculating the state of the system at any other time t x . 

By the 1920s it was realized that such a calculation was too detailed in principle. 
The Heisenberg uncertainty principle states that it is impossible to precisely specify 
both the momentum and position of a particle simultaneously. Consequently, the 
prescription given by classical mechanics had to be modified to include the principle of 
uncertainty. This modification resulted in the development of quantum mechanics. 

There are a number of levels of introducing the central ideas of quantum mechanics, 
but for most of the material in this text, we need consider only the most elementary. 
A fundamental concept of quantum mechanics is the so-called wave function ^(q, t) 9 
where q represents the set of coordinates necessary to describe the system. The wave 
function is given the physical interpretation that the probability that at time t the 
system is found between q x and q Y + dq u q 2 and q 2 + dq 2 , and so on, is 

We shall often write dq x • • • dq 3N as dq. The uncertainty principle dictates that ^(q, t) 
is the most complete description of the system that can be obtained. Since the system 
is sure to be somewhere, we have 


JV(q,fmq, t)dq = 1 (1-26) 

If Eq. (1-26) is satisfied, ¥ is said to be normalized. 

A central problem of quantum mechanics is the calculation of ^(q, t) for any 
system of interest. We denote the time-independent part of ^(q, t) by ^(q). The state 
of the system described by a particular ^(q) is said to be a stationary state. Through- 
out this book we shall deal with stationary states only. 

For our purpose, the wave function ip is given as the solution of the Schrodinger 
equation 

34?\p = Eip (1-27) 

where 34? is the Hamiltonian operator, and £ is a scalar quantity corresponding to the 
energy of the system. The Hamiltonian operator is 


h 2 


2m 





2m 


V 2 


+ 


U(x 9 y, z) 


(1-28) 


where h is h f 2n 9 that is, Planck’s constant divided by 2n. The first term here corre- 
sponds to the kinetic energy, and the second term is the potential energy. The Hamil- 
tonian operator, then, corresponds to the total energy. There is a quantum mechanical 
operator and an equation similar to Eq. (1-27) corresponding to every quantity of 
classical mechanics, but we shall need only the one for the energy, namely, the 
Schrodinger equation. 

Given certain physical boundary conditions of the system, a knowledge of 34? alone 
is sufficient to determine \p and E. The wave function ip is called an eigenfunction of 
the operator 34 ? , and E is called an eigenvalue. There will usually be many ip’s and 
£’s that satisfy Eq. (1-28), and this is indicated by labeling \p and E with one or more 
subscripts. Generally, then, we have 

^j = Ejilij 


(1-29) 
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Equation (1-29) is a partial differential equation for The application of the 
boundary conditions often limits the values of Ej to only certain discrete values. Some 
simple examples are 

1 . a particle in a one-dimensional infinite well : 


h 2 d 2 


2m dx 


2 


h 2 n 2 




n = 1, 2, ... 


ima 2 

2. a simple harmonic oscillator: 

h 2 d 2 1 2 

2te=- — — 2 + r kx 1 
2 m dx 2 

e u .= (n + $)hco n = 0,1,2,... 

where to = (k:m) 1/2 . 

3. a rigid rotor (see Problem 1-21): 

li 2 | 1 a/. 1 d 2 \ 

x --2j(5^sr e ») + ii?ea?) 

jtj + in 2 


£ i — 


21 


J = 0, 1, 2, 


( 1 - 30 ) 


( 1 - 31 ) 


( 1 - 32 ) 


Here I is the moment of inertia of the rotor (see Problem 1-15 for a treatment of the 
classical counterpart of this system). 

The rigid rotor illustrates another important concept of quantum mechanics, 
namely, that of degeneracy. It happens that there may be a number of eigenfunctions 
or states of the system having the same eigenvalue or energy. The number of eigen- 
functions having this energy is called the degeneracy of the system. For the rigid rotor, 
the degeneracy, co j9 is 2 J + 1. The particle in a one-dimensional infinite well and the 
simple harmonic oscillator are nondegenerate, that is, the (D n are unity. The concept 
of energy states and degeneracy plays an important role in statistical thermodynamics. 

Consider the energy states of a particle in a three-dimensional infinite well. These 
are given by 

h 2 

= g ^3 W + n 2 + n 2 ) n x , n y , n z = 1 , 2, 3, . . . (1-33) 


The degeneracy is given by the number of ways that the integer M = 8ma 2 e'h 2 can 
be written as the sum of the squares of three positive integers. In general, this is an 
erratic and discontinuous function of M (the number of ways will be zero for many 
values of M), but it becomes smooth for large M, and it is possible to derive a simple 
expression for it. Consider a three-dimensional space spanned by n X9 n y9 and n z . There 
is a one-to-one correspondence between energy states given by Eq. (1-33) and the points 
in this n x9 n y9 n z space with coordinates given by positive integers. Figure 1-1 shows a 
two-dimensional version of this space. Equation (1-33) is an equation for a sphere of 
radius R = (8 ma 2 e/A 2 ) 1/2 in this space 


n 2 + n 2 + n 2 


8 ma 2 e 

~1F~ 


= R 2 
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Figure 1-1. A two-dimensional version of the (n Xy n y , n z ) space, the space with the quantum numbers 
n X9 riy, and n z as axes. 


We wish to calculate the number of lattice points that are at some fixed distance from 
the origin in this space. In general, this is very difficult, but for large R we can proceed 
as follows. We treat R or e as a continuous variable and ask for the number of lattice 
points between e and e + Ae. To calculate this quantity, it is convenient to first calculate 
the number of lattice points consistent with an energy <e. For large e, it is an excellent 
approximation to equate the number of lattice points consistent with an energy <e 
with the volume of one octant of a sphere of radius R. We take only one octant, 
because n x9 n y9 and n z are restricted to be positive integers. If we denote the number 
of such states by <X>(e), we can write 


_ 1 /4irR 3 \ n/8ma 2 e\ 312 

HbH 


(1-34) 


The number of states between e and e + Ae (Ae'e 4. 1) is 


co(e, Ae) = 0(e + Ae) — O(e) 

0-36) 

If we take e = 3kTj2 9 T = 300°K, m = 10" 22 g, a = 10 cm, and Ae to be O.Ole (in 
other words a 1 % band around e), then co(e, Ae) is O(l0 28 ).* So even for a system 
as simple as a particle-in-a-box, the degeneracy can be very large at room tem- 
perature. 

For an N-particle system, the degeneracy is tremendously greater than 0(1 0 28 ). 
To see this, consider a system of N noninteracting particles in a cube. The energy of 
this system is 


N 


3 N 


E = 5372 I ("*/ + ”yj 2 + «*/) = 5—2 E S J 


8ma j=i 


8ma 2 j= i 


where n xj9 n yJ , n zj9 and Sj are positive integers. The degeneracy of this system can be 
calculated by generalizing the above derivation for one particle. Using the volume of 


* We use the notation O(10 28 ) say, to mean of the order of magnitude 10 28 . This differs from standard 
mathematical notation, but there should be no confusion. 
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an N-dimensional sphere from Problem 1-24, the number of states with energy <£is* 

1 / 2nma 2 E\ 3N 12 

= r (N + l)r[(3N'2) + 1] ( h 2 ) 0-3 ] 

where T(«) here is the gamma function. (See Problem 1-58.) The number of states 
between E and E + A E is 

1 / 2nma 2 \ 3N/ 2 ( ^ Nn M 

In this case, E = 3NkTl2. If we take T = 300°K, m = \0~ 22 g, a = 10 cm, 
N = 6.02 x 10 23 , and A E equal to 0.01£, we get Q(E, A E) to be O(10 N ) (see Problem 
1-23), an extremely large number. This shows that as the number of particles in the 
system increases, the quantum mechanical degeneracy becomes enormous. Although 
we have shown this only for a system of noninteracting particles confined to a cubical 
box, that is, an ideal gas, the result is generally true. We shall see in the next chapter 
that the concept of the degeneracy of a macroscopic system is very important. 

There is another quantum mechanical result that we shall use later on. It often 
happens that the Hamiltonian of a many-body system can be written either exactly or 
approximately as a summation of one-particle or few-particle Hamiltonians, that is, 

& = #e a + s/e p + + * • • 0 - 38 ) 

Let the eigenvalues of } be , and the eigenfunctions be ipj , where j = a, p, y, . . . 
To solve the many-body Schrodinger equation, we let ^ • • • • Then 


p + y 4 - 

= ••• JP’a'I'a + tf'atf'y + mmm 

= 'l'p'l'yta'l'a+ 'l'a'l'ytp'l'p+ 

= (e a + e fi + • • • “ = Eip (1-39) 

In other words, the energy of the entire system is the sum of the energies of the indi- 
vidual particles if they do not interact. This is a very important result and will allow 
us to reduce a many-body problem to a one-body problem if the interactions are weak 
enough to ignore, such as in the case of a dilute gas. We shall see a number of cases 
where, even though the interactions are too strong to be ignored (such as in a solid), it 
is possible to formally or mathematically write the Hamiltonian in the form of 
Eq. (1-38). This will lead to defining quasi-particles like phonons and photons. 

The last quantum mechanical topic we shall discuss here is that of the symmetry of 
wave functions with respect to the interchange of identical particles. Consider a system 
of N identical particles, described by a wave function 2, 3, ..., N) 9 where 1 
denotes the coordinates of particle 1, and so on. If we interchange the position of any 
two of the particles, say particles 1 and 2, the wave function must either remain the 
same or change sign. (See Problem 1-26.) Thus if we let P l2 be an operator that 
exchanges the two identical particles 1 and 2, then 


P 12 ^(l,2, 3, . . . , N) = 1, 3, ...,2V) 

= ±^(1,2, 3 ,...,N) 


(1-40) 


* The extra factor of T(N+ 1) occurs here because of the indistinguishability of the N particles. This will 
be discussed fully in Chapter 4. 
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It turns out that whether the wave function remains the same or changes sign is a 
function of the nature of the two identical particles that are exchanged. For particles 
with an integral spin (such as the He-4 nucleus, photons, . . .), the wave function 
remains the same. In this case the wave function is called symmetric, and such particles 
are called bosons. For particles with half-integral spin (such as electrons, ...), the 
wave function is called antisymmetric, and the particles are called fermions. Chapter 
4 considers the consequences of this symmetry requirement of wave functions. 


1-4 THERMODYNAMICS 

In this section we shall not attempt to review thermodynamics, but shall simply state 
the three laws and briefly discuss their consequences. Problems 1-27 through 1-36 
review some of the equations and manipulations that arise in thermodynamics. Two 
topics that are not often treated in elementary physical chemistry are presented here, 
namely, Legendre transformations and Euler’s theorem. Both of these topics will be 
used later on. 

The pressure-volume work done by a system on its surroundings in going from 
state A to state B is 



where p is the pressure exerted by the surroundings on the system. The differential 
quantity 8w is positive if dV is positive. 

The heat absorbed by the system from the surroundings during the change of the 
system from state A to state B is 

r B 

q = I Sq 

The first law of thermodynamics states that even though w and q depend upon the 
path taken from A to B, their difference does not. Their difference, then, is a function 
only of the two states A and B 9 or, namely, is a state function. This function is called 
the internal energy or thermodynamic energy and is denoted by E. 

The first law of thermodynamics is 

A E = E B — E A = q — w 

B B 

= J dq- J pdV (1-41) 

For simplicity, we consider only p~V work. 

A reversible change is one in which the driving force (a difference in pressure, a 
difference in temperature, and so on) is infinitesimal. Any other change is called 
irreversible or spontaneous. Problem 1-27 asks the reader to show that for an iso- 
thermal process, w rcv > w irrev and q TCV > q 1Trcy . 

The first law of thermodynamics is nothing but a statement of the law of conserva- 
tion of energy. The second law is somewhat more abstract and can be stated in a 
number of equivalent ways. One of them is: There is a quantity S 9 called entropy, which 
is a state function. In an irreversible process, the entropy of the system and its surround- 
ings increases. In a reversible process, the entropy of the system and its surroundings 
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remains constant. The entropy of the system and its surroundings never decreases. The 
system and its surroundings are often referred to as the universe. 

The mathematical expression for the difference in entropy between states A and B 
of a system is given by 



Note that the heat appearing here is that associated with a reversible process. To 
compute AS between two states A and B y we must take the system from A to B in a 
reversible manner. 

Another statement of the second law is : Along any reversible path, there exists an 
integrating factor T, common to all systems such that 

dS = (1-43) 

is an exact differential, that is, that S is a state function. Thus 



For all other processes 



where T is the temperature of the surroundings. 

The third law of thermodynamics states: If the entropy of each element in some 
crystalline state be taken as zero at the absolute zero of temperature, every substance 
has a finite positive entropy, but at the absolute zero of temperature, the entropy may 
become zero, and does become so in the case of perfect crystalline substances. 

The second law is concerned with only the difference in the entropy between two 
states. The third law allows us to calculate the absolute entropy of a substance by 
means of the expressions 

S - S 0 = f and S 0 = 0 (1-44) 

Jo T 


Problem 1-36 asks you to calculate the absolute entropy of gaseous nitromethane at its 
boiling point. 

For simple one-component systems, the first law can be written in the form 
dE = T dS - pdV (1-45) 

This implies that 

(i),.- 7 ' (HX “ _jP <1 - 46> 

The simplicity of these partial derivatives implies that E is a “ natural ” function of S 
and V . For example, if we were to consider E to be a function of V and T (see 
Problem 1-30), we would get 

dE -[ T (w) r - p } d, ' +c - JT 
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(a) (b) 

Figure 1-2. (a) shows the function ^(x) and (b) shows a family of functions, all of which give the same 
value of y for any fixed value of p. 

Note that in this case the coefficients of dV and dT are not as simple as the coefficients 
of dV and dS obtained when E is expressed as a function of S and V. The 44 simplicity ” 
of the expression dE = T dS — pdV suggests that S and V are the 44 natural ” variables 
for E. The quantities S and V (especially S) are difficult to control in the laboratory 
and consequently are not always the most desirable independent variables. A more 
useful pair might be (T, V) or (T, p). An important question that arises, then, is the 
existence of other thermodynamic state functions whose natural variables are (T, V) 
or (T, p), and so on. Furthermore, how would one find them if they do exist. This 
leads us to the topic of Legendre transformations. 

We shall discuss a function of one variable in some detail and then simply present 
the generalization to a function of many variables. Consider a function y = y(x), and 
let its slope be p = p(x). We wish to describe the function y(x) in terms of its slope. 
Figure 1-2, however, shows that the slope alone is not sufficient to completely specify 
y(x). Figure l-2(a) shows the curve y(x), and Fig. l-2(b) shows a family of curves, all 
of which give the same value of y for any one value of p. In order to uniquely describe 
the curve in Fig. l-2(a), we must select one member of the family of curves in 
Fig. 1— 2(b). We do this by specifying the intercepts of the tangent lines with the y-axis. 
Let the intercept be $(p). Instead of describing the curve in Fig. l-2(a) by y versus x, 
then, we can equally well represent it by specifying the slope at each point along with 
the intercept of the slope with the y-axis. Figure 1-3 shows these two representations. 
One sees that either representation can be used to describe the function. The relation 
between the two representations can be obtained by referring to Fig. 1-4. This figure 
shows that the slope p at any point is given by 

P X-0 

The result that we are after is 

<I>(p)=y-px (1-47) 

The function $(p) is the Legendre transformation of y. It is completely equivalent to 
y(x), but considers p to be the independent variable instead of x. This may not be 
clear from the notation in Eq. (1-47), but it is understood there that y and x have been 
eliminated in favor of p by using the equations y = y(x) and x = x(p ). 

Let us apply this to the thermodynamic energy E(S, V). We seek a function of 
T and V that is completely equivalent to E. Equation (1-46) shows that T = (dE/dS ) v , 



(a) r-.v representation (b) <p-p representation 

Figure 1-3. In (a), the function is represented by the locus of points, y versus x. In (b), the same function 
is given by the envelope of its tangent curves. 

and so we are in a position to apply Eq. (1-47) directly. This can be treated as a one- 
variable problem, since V is held fixed throughout. Therefore the Legendre transfor- 
mation of E that considers T and V to be the independent variables is E — TS. Of 
course, this is the Helmholtz free energy 


A(T, V) = E — TS 

(1-48) 

whose differential form is 


dA = -SJT-pdV 

(1-49) 


This shows that the natural variables of A are T and V. Another motivation for saying 
this is that the condition for equilibrium at constant T and V is that A assume its 
minimum value, or that A A ^ 0 for a spontaneous process at constant T and V To 
prove this, write 

dA = dE—TdS — S dT 

= 6q-pdV-6q rey -SdT 

= Sq- <5$ rev (1 — 50) 



Figure 1-4. The diagram used to derive the connection between the y-x representation and the <j>-p 
representation. 
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at constant T and V But Sq < 8q rey (see Problem 1-27), and so AA < 0 at constant 
Tand V 

In elementary physical chemistry, the function A = E — TS is often presented as 
an a priori definition. But it should be apparent now that this form is dictated by the 
Legendre transformation if one specifies Tand Kto be the independent variables. 

A function, whose natural variables are S and p, can be obtained in the same 
manner. Equation (1-46) shows that p = — (dEidV) s , so Eq. (1-47) gives that E + pV 
is a thermodynamic state function, whose natural variables are S and p. This function 
is, of course, the enthalpy. 

The generalization of Eq. (1-47) to more than one variable is simply 

4>(p)=y-Y,PjXj (i-5i) 

j 

where the x/s are the independent variables of y, and pj = (dyjdxj). We can use 
Eq. (1-51) to construct a thermodynamic state function, whose natural variables are 
T and p. Using Eqs. (1-46) and Eq. (1-51), we see that such a function is E — TS + pV 9 
the Gibb’s free energy. Its differential form dG = — SdT-\- V dp and the fact that 
A G < 0 for a spontaneous change at constant T and p suggest that T and p are the 
natural variables of G. 

Up to this point we have considered only closed one-component systems. In general, 
E,H, A 9 and G depend upon the number of moles or molecules of each component. If 
we let Nj be the number of moles of component j, we have 


dE = T dS-pdV + 



S, V, Nk, j*k 


dNj 


= TdS-pdV +Y,n J dN J 

j 


(1-52) 

(1-53) 


where the second line defines pj . By adding d(pV) to both sides of Eq. (1-53), we get 


dH = T dS + V dp + £ pj dN } 

j 

If we subtract d(TS ) from both sides of Eq. (1-53), we get 

dA = -S dT - p dV + YPjdNj 

J 

Similar manipulations give 

dG= -SdT+ Vdp + ^pjdNj 

j 

Equations (1-52) through (1-56) show that 

_ ( dE \ - ( dH \ - ( dA \ - ( dG \ 

J W j) Wv s, p, ... W j) V, T, ... W j) p, T, 


(1 — 54) 


(1-55) 


(1-56) 


(1—57) 


The quantity fij is called the chemical potential. 

There is a mathematical theorem, called Euler’s theorem, which is very useful in 
thermodynamics. Before discussing Euler’s theorem, however, we must define exten- 
sive and intensive variables. Extensive properties are additive; their value for the 
whole system is equal to the sum of their values for the individual parts. Examples are 
the volume, mass, and entropy. Intensive properties are not additive. Examples are 
temperature and pressure. The temperature of any small part of a system in equili- 
brium is the same as the temperature of the whole system. Euler’s theorem deals with 
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extensive and intensive variables. If 


f(Ax u Ax 2 , . . . , Xx n ) = A n f(x u x 2 , . . • , x N ) ( 1 - 58 ) 

/ is said to be a homogeneous function of order n. The functions /(x) = 3x 2 and 
f(x, y, z) = xy 2 + z 3 — 6x*/y are homogeneous functions of degree 2 and 3, respec- 
tively, whereas /(x) = x 2 + 2x - 3 and /(x, y) = xy — e xy are not homogeneous. 
Euler’s theorem states that if f(x u . . . , x N ) is a homogeneous function of order n, then 


x df df df 

" fix ' Xx) - x 'er, + x ’e^ + '" + x "d^ 


( 1 - 59 ) 


The proof of Euler’s theorem is simple. Differentiate Eq. (1-58) with respect to X: 


nX” i f(x u x 2 . 



Euler’s theorem is proved by letting X = 1. 

Extensive thermodynamic variables are homogeneous of degree 1. Let us apply 
Euler’s theorem to the Gibb’s free energy. 


G(T, p, XN U XN 2 ,...) = XG(T, p,N 1 ,N 2 ,...) 


The variables T and p here can be treated as constants. Equation (1-59) gives that 




Taking the derivative of this at constant T and p, 


( 1 - 60 ) 


dG = £ Nj dfij + £ fij dNj (constant T and p) 
j J 

But using Eq. (1-56) at constant Tand p, we have 

£ Nj dpj = 0 (constant T and p) (1 -61 ) 

This is called the Gibbs-Duhem equation and is very useful in the thermodynamic 
study of solutions. (See Physical Chemistry , 4th ed., by W. J. Moore, p. 235, under 
“Additional Reading,” for a simple application of the Gibbs-Duhem equation.) 

We shall conclude this section on thermodynamics with a brief discussion of the 
application of thermodynamics to chemical equilibria. Consider the general reaction 


v a A + v b B + • • • ^ v d D + v e E + • • • (1-62) 

The capital letters represent the formulas of the compounds, and the v y - represent 
stoichiometric coefficients. It is more convenient to write Eq. (1-62) mathematically as 

v d D + v e E + • • • — v a A — v b B — • • • = 0 (1-63) 

Define the extent of reaction 2, such that dNj = Vj dX for all j, where the v’s for pro- 
ducts are positive, and those for reactants are negative. 

At constant T and p, we have 

dG = £ fij dNj = ^ fij dX (constant T and p) 
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At equilibrium, G must be a minimum with respect to A, so we write 

£ Pi Vj = VdUd + v e Pe +■■■ ~ v a Pa - VbVb 0 l 1 - 64 ) 

j 

at equilibrium. The equilibrium between phases can be considered to be a chemical 
reaction of the form A ^ B, and so Eq. (1-64) gives that \i A = p B from the equilibrium 
condition between two pure phases. 

Now consider the application of Eq. (1-64) to a chemical reaction between gases 
dilute enough to be considered ideal. Let the reaction be v a A + v b B^v c C 4 - v d D. 
At constant temperature, 

dG = V dp (constant T) 

and so 

r p r p NkT v 

G — G° = I V dp =\ dp = NkT In — ( 1 - 65 ) 

Jpo J Po P Po 

In this equation G° is the standard free energy of the gas, the standard state being the 
gas at a pressure p 0 . Usually p 0 is taken to be 1 atmosphere. If we take N to be 1 
mole, then G and G° become p and p°. Each component in the reactive gas mixture 
will have an equation of the form of Eq. (1-65), and so we have 


»j(T, p) = + RT ln£- 

Poj 


The total free energy change is 


= v c fi c + v D fi D - v Af i A - v B pg 


= Ap° + RT In 


(pcy c (p D y° 

(PxT"(PbT b 


( 1 - 66 ) 


(1-67) 


In this equation the (p')’s are plp 0 , that is, they are the pressures relative to the 
standard states. These (p')’ s are unitless. The argument of the logarithm here has the 
form of an equilibrium constant, but is not equal to the equilibrium constant unless 
the pressures are those which exist at chemical equilibrium. Equation (1-67) gives the 
change in free energy of the conversion of reactants at arbitrary pressures to products 
at arbitrary pressures. 

At equilibrium, Ap = 0, and we have 


Ap° = -RT In 


(Pc7 c (Pd7 p 

iPA'r A (PB 7 s 


equilibrium 


= -RTlnK p 


( 1 - 68 ) 


There are extensive tabulations of ^ 0, s, and so Ap° is a simple matter to calculate. We. 
see that if A fi° < 0, then K p > 1, that is, the conversion of reactants in their standard 
states to products in their standard states proceeds spontaneously. On the other hand, 
if Ap° > 0, then K p < 1, and we conclude that the reaction does not proceed spon- 
taneously. It should be understood, however, that this applies only to reactants and 
products in their standard states. In general, it is Ap along with Eq. (1-67) that deter- 
mines the extent of a chemical reaction. (See Problem 1-34.) 
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1-5 MATHEMATICS 

In this section we shall discuss several mathematical techniques or results that are 
repeatedly used in statistical thermodynamics. The topics we shall discuss here are 
random variables and distribution functions, Stirling’s approximation, the binomial 
and multinomial coefficients, the Lagrange method of undetermined multipliers, and 
the behavior of binomial and multinomial coefficients for large numbers. 


PROBABILITY DISTRIBUTIONS 

Let u be a variable which can assume the M discrete values u u u 2 , . . . , u M with 
corresponding probabilities p(«i), p(u 2 ), . . . , p(u M ). The variable u is said to be a 
discrete random variable, and p(u) is said to be a discrete distribution. The mean, or 
average, value of u is 

- Tj= 1 u jP( u j) 

IZLiM 

Since p(uj ) is a probability, p(uj) must be normalized, that is, the summation in the 
denominator must equal unity. The mean of any function of «,/(«), is given by 

m = Yf(Uj)p(uj) (1-69) 

J=1 


If /(«) = u m ,f(u) is called the /wth moment of the distribution p(u). If /(«) = (u — u) m , 
f(u ) is called the with central moment, that is, the with moment about the mean. In 
particular, the mean of (u — u) 2 is called the variance, and is a measure of the spread 
of the distribution. The square root of the variance is the standard deviation. 

A very commonly occurring and useful discrete distribution is the Poisson distribu- 
tion: 


P(m) = 


a m e~ a 

ml 


m = 0, I, 2, ... 


(1-70) 


This distribution has been applied to shot noise in electron tubes, the distribution of 
galaxies in space, aerial search, and many others.* (See Problem 1-42.) 

If the random variable U is continuous rather than discrete, then we interpret 
p(u) du as the probability that the random variable U lies between the values u and 
u + du. The mean of any function of U is 


/(«) = J f(M)p{u) du 


(1-71) 


The limits of the integral are over the entire range of U. 

The most important continuous probability distribution is the Gaussian distribu- 
tion: 


p(x) = 


(bio 2 ) 112 CXP { 


(*-*) 

2a 2 


— 00 < X < 00 


(1-72) 


The quantity a 2 , which is the variance, controls the width of the Gaussian distribution. 
The smaller the a, the narrower the Gaussian distribution becomes. In the limit a -► 0, 
Eq. (1-72) becomes a delta function (this is one representation of a delta function of 
Appendix B). Problems 1-43 through 1-45 involve some important results based on 
Eq. (1-72). 


* See Modem Probability Theory and Its Applications by E. Parzen (New York : Wiley, 1960). 
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STIRLING'S APPROXIMATION 

In statistical thermodynamics we often encounter factorials of very large numbers, 
such as Avogadro’s number. The calculation and mathematical manipulation of 
factorials become awkward for large N. Therefore it is desirable to find an approxima- 
tion for N\ for large N. Problems of this sort occur often in mathematics and are 
called asymptotic approximations, that is, an approximation to a function which 
improves as the argument of that function increases. Since N! is actually a product, it 
is convenient to deal with In N\ because this is a sum. The asymptotic approximation 
to lnN! is called Stirling’s approximation, which we now derive. 

Since N\ = N(N - 1 )(N - 2) ■ • • (2)(1), In N\ is 

N 

lnN!=£lnw (1-73) 

m= 1 


Figure 1-5 shows In x plotted versus x. The sum of the areas under these rectangles 
up to N is In N\. Figure 1-5 also shows the continuous curve In jc plotted on the same 
graph. Thus In x is seen to form an envelope to the rectangles, and this envelope 
becomes a steadily smoother approximation to the rectangles as x increases. We can 
approximate the area under these rectangles by the integral of In x. The area under 
In x will poorly approximate the rectangles only in the beginning. If N is large enough 
(we are deriving an asymptotic expansion), this area will make a negligible contribu- 
tion to the total area. We may write, then. 


N r N 

lnN\= Y, lnm« | In x dx = N InN — N 

m= 1 


(N large) 


(1-74) 


which is Stirling’s approximation to InN!. The lower limit could just as well have 
been taken as 0 in Eq. (1-74), since N is large. (Remember that x In x -> 0 as x -> 0.) 

A more refined derivation of Stirling’s approximation gives In N\ & N In N — N 
4- ln(27rN) 1/2 , but this additional term is seldom necessary. (See Problem 1-59.) 



Figure 1-5. A plot of In x versus x, showing how the summation of In m can be approximated by the 
integral of In x. 
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BINOMIAL AND MULTINOMIAL DISTRIBUTION 

During the course of our discussion of the canonical ensemble, we shall encounter 
the problem of determining how many ways it is possible to divide N distinguishable 
systems into groups such that there are n l systems in the first group, n 2 systems in 
the second group, and so on, and such that n l + n 2 + = N, that is, all the systems 

are accounted for. This is actually one of the easiest problems in combinatorial 
analysis. To solve this, we first calculate the number of permutations of N distin- j 
guishable objects, that is, the number of possible different arrangements or ways to 
order N distinguishable objects. Let us choose one of the N objects and place it in 
the first position, one of the N — 1 remaining objects and place it in the second posi- 
tion, and so on, until all N objects are ordered. Clearly there are N choices for the first 
position, N — 1 choices for the second position, and so on, until finally there is only 
one object left for the Nth position. The total number of ways of doing this is then the 
product of all the choices, 

N(N — 1 )(N — 2) • • • (2)(1) = N\ (distinguishable objects) 

Next we calculate the number of ways of dividing N distinguishable objects into 
two groups, one group containing N x objects, say, and the other containing the remain- 
ing N — N v There are N(N — 1) • • - (N — N x + 1) ways to form the first group, and 
N 2 ! = (N — N t )! ways to form the second group. The total number is, then, the 
product 

N(N - 1) • • • (N - N, + 1) x (N - N x ) ! = ™ , x (N - N t ) ! = N ! 

But this has overcounted the situation drastically, since the order in which we place 
N l members in the first group and N 2 in the second group is immaterial to the problem 
as stated. All N x \ orders of the first group and N 2 ! orders of the second group corre- 
spond to just one division of N objects into N x objects and N 2 objects. Therefore the 
desired result is 


N\ Nl 

N X \(N - N x )l ~ Ni!N 2 ! 


( 1 — 75 ) 


Since the combination of factorials in Eq. (1-75) occurs in the binomial expansion. 


(x + y) N = 


" N\x N ~ Ni y Nt 
N ho NJiN-NJl 


N\x Nt y Nl 
hi N,!N 2 ! 


( 1 - 76 ) 


N !; Nj \(N — Afj)! is called a binomial coefficient. The asterisk on the second summa- 
tion in Eq. (1-76) signifies the restriction N l + N 2 = N. 

The generalization of Eq. (1-75) to the division of N into r groups, the first con- 
taining N u and so on, is easily seen to be 


Nl 


Nl 


N t iN 2 i ••• N r i rc =1 7v 


( 1 - 77 ) 


where N l +N 2 + '" + N r = N. This is known as a multinomial coefficient, since it 
occurs in the expansion 


(x,-|-x 2 +"-+x r ) w = XX ^ n- AM 

JVi = 0N 2 =0 JV r = 0 lb=i^V 


£• N !x 1 w ‘-"X r w ' 


where this time the asterisk signifies the restriction N t + N 2 H + N r = N. 


( 1 - 78 ) 
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There are a number of other combinatorial formulas that are useful in statistical 
thermodynamics, but Eq. (1-77) is the most useful for our purposes. Combinatorial 
formulas can become rather demanding to derive. We refer to Appendix AV11 of 
Mayer and Mayer* which contains a collection of formulas. 


METHOD OF LAGRANGE MULTIPLIERS 

It will be necessary, later, to maximize Eq. (1-77) with the constraint 
Ni + N 2 + • • • + N r = constant. This brings us to the mathematical problem of 
maximizing a function of several (or many) variables f(x l9 x 2 , . x r ) when the 
variables are connected by other equations, say g x (x l9 . . . , x r ) = 0, g 2 (x l9 . . . , x r ) = 0, 
and so on. This type of problem is readily handled by the method of Lagrange un- 
determined multipliers. 

If it were not for the constraints, gj(x l9 x 2 , . . . , x r ) = 0, the maximum of f(x l9 . . . , x r ) 
would be given by 


Sf= 



Sxj = 0 


(1-79) 


where the zero subscript indicates that this equation equals zero only when the r 
partial derivatives are evaluated at the maximum (or minimum) of f Denote these 
values of Xj by Xj° If there were no constraints, each of the Sxj would be able to be 
varied independently and arbitrarily, and so we would conclude that (df/dxj) = 0 for 
every j 9 since df must equal zero. This would give r equations from which the values 
of the rxj° could be obtained. 

On the other hand, if there is some other relation between the x’s, such as 
g(x l9 x 2 , . . . , x r ) = 0, we have the additional equation 



This equation serves as a constraint that the Sxj must satisfy, thus making one of them 
depend upon the other r — 1. In the Lagrange method, one multiplies Eq. (1-80) by 
some parameter, say X, and adds the result to Eq. (1-79) to get 



(1-81) 


The Sxj are still not independent, because of Eq. (1-80), and so they cannot be varied 
independently. Equation (1-80), however, can be treated as an equation giving one 
of the 8xj in terms of the other r — 1 independent ones. Pick any one of the r 8xj 
as the dependent one. Let this be 8x M . 

The trick now is that we have not specified X yet. We set it equal to {dfjdx^j 
(dg/dxj o, making the coefficient of Sx^ in Eq. (1-81) vanish. The subscript zero 
here indicates that (df/dxj and (dg/dxj are to be evaluated at values of the Xj 
such that /is at its maximum (or minimum) under the constraint of Eq. (1-80). Of 
course, we do not know these values of Xj yet, but we can nevertheless formally define 
X in this manner. This leaves a sum of terms in Eq. (1-81) involving only the inde- 
pendent Sxj , which can be varied independently, yielding that 



j= 1,2,..., j/- 1 ,g.+ 1, . . . , r 


* See Mayer and Mayer, Statistical Mechanics (New York: Wiley, 1940). 
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If we combine these r— 1 equations with our choice for X, we have 



(1-82) 


for all j. 

As we said above, the choice of X here is certainly formal, since both (df/dx ^) 0 and 
(dg/dx^Q must be evaluated at these values of x } which maximizes /, but these are 
known from Eq. (1-82) only in terms of 2. But this presents no difficulty, since in 
practice X is determined by physical requirements. Examples of this will occur in the 
next two chapters. 

Lagrange’s method becomes no more difficult in the case in which there are several 
constraints. Let gi(x u . . . , x r ), g 2 (x l9 . . . , x r ), ... be a set of constraints. We introduce 
a Lagrange multiplier for each £ 2 (jc 19 . . . , x r ) and proceed as above to get 


Sf 3 d9i , Sg 2 

dxj A ' 8xj kl dxj 


(1-83) 


BINOMIAL DISTRIBUTION FOR LARGE NUMBERS 

Lastly, there is one other mathematical observation we need here in order to 
facilitate the discussion in the next chapter. This observation concerns the shape of 
the multinomial coefficient [Eq. (1-78)] as a function of the Nf s, as the N/s become 
very large. To simplify notation, we shall consider only the binomial coefficient, but 
this will not affect our conclusions. Let us first find the value of for which 
f(N x ) = Nl/N^iN — NJl reaches its maximum value. Since and N are both 
very large, we treat them as continuous variables. Also since In x is a monotonic 
function of x, we can maximize /(N^) by maximizing In /(Nj). This allows us to use 
Stirling’s approximation. The maximum of f(N x ) is found, then, from 

din AN,) 
dN x 

to be located at = N, f 2. Let us now expand In /(N^) about this point. The Taylor 

expansion is 

In AN,) = ln/(/V) + \ f !?/ ( 2 Nl) ) (Ni - Hi*) 2 + — O-M) 

l \ aiy 1 /N t =Ni* 

The linear term in N l — IV 2 * is missing, because the first derivative of In f(N) is zero 
at N t = N t *. The second derivative appearing in Eq. (1-84) is equal to —4 /IV. Thus 
if we ignore higher-order terms (see Problem 1-53), Eq. (1-84) can be written in the 
form of a Gaussian curve 

0 . 86) 

Comparison of this with the standard form of the Gaussian function 

/(X) = (Inc 2 ) 1 ' 2 CXP { “ la 2 ] } (1 " 86) 

shows that the standard deviation is of the order of N 1/2 . Equation (1-85) is, there- 
fore, a bell-shaped function, centered at N x * = N/2 and having a width of a few 
multiples of N ln . Problem 1-43 establishes the well-known fact that a Gaussian 
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function goes essentially to zero when x differs from x* by a few a’s. Since we are 
interested only in large values of N x (or N), say numbers of the order of 10 20 , we have 
a bell-shaped curve that is contained between 10 20 ± a few multiples of 10 10 , which, if 
plotted, would for all practical purposes look like a delta function centered at 
N x * = Nil. Thus we have shown that the binomial coefficient peaks very strongly at 
the point = N 2 = N 2. This same behavior occurs for a multinomial coefficient 
as well. If there are s Nf s, the multinomial coefficient has a very sharp maximum at the 
point N x = N 2 = * * * = N s = N is. (See Problem 1-50.) This peak becomes sharper as 
the Nf s become larger, and become a delta function in the limit Nj -»> oo for all j. 

MAXIMUM TERM METHOD 

Another important result, which is a consequence of the large numbers encountered 
in statistical mechanics, is the maximum-term method. It says that under appropriate 
conditions the logarithm of a summation is essentially equal to the logarithm of the 
maximum term in the summation. To see how this goes, consider the sum 

M 

S=£T W 

N= 1 

where T N >0 for all N. Since all the terms are positive, the value of S must be greater 
than the value of the largest term, say 7’ max , and less than the product of the number 
of terms and the value of the largest term. Thus we can write 

T„, ax <S <, MT^ 

Taking logarithms gives 

In T max <, In S < In + In M 

We shall see that it is often the case in statistical mechanics that r max will be 0(e M ). 
Thus we have 

0(A/) < In S < 0(M) + In M 

For large M, In M is negligible with respect to M itself, and so we see that In S is 
bounded from above and below by In , and so 

In S = In T max 

This is a rather remarkable theorem, and like a number of other theorems used in 
statistical mechanics, its validity results from the large numbers involved. 
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PROBLEMS 

1-1. Solve the equation of motion of a body of mass m dropped from a height h. Assume 
that there exists a viscous drag on the body that is proportional to and in the opposite direction 
to the velocity of the body. (Let the proportionality constant be y.) Solve for the so-called 
terminal velocity, that is, the limiting velocity as co. 

1-2. Calculate the trajectory of a shell shot out of a cannon with velocity v 0 , assuming no 
aerodynamic resistance and that the cannon makes an angle 0 with the horizontal axis. 

1-3. Remembering that the potential energy is given by 

V(x)=- f F(£)d£ = ikx* 

J o 

for a simple harmonic oscillator, derive an expression for the total energy as a function of 
time. Discuss how the kinetic and potential energy behave as a function of time. 

1-4. Solve the equation for a harmonic oscillator of mass m and force constant k that is 
driven by an external force of the form F(t) = F 0 cos co 0 1. 

1-5. Show that 

f (/) = A sin cot -j- B cos cot 

can be written as 

) = C sin(cof + <f>) 

1-6. Show that the total linear momentum is conserved for a system of N particles with an 
interaction potential which depends only on the distance between particles. 

1-7. When does p - BL/dq but f=dK/dql 

1-8. Consider a system of two-point particles with masses nit and m 2 moving in two 
dimensions. It is very common for their potential of interaction to depend upon their relative 
coordinates (xi — x 2 ,yi — y 2 ) only. Thus the total energy is 

mi m 2 

E = — {xi 2 + yi 2 ) + — ( x 2 2 + y 2 2 ) + U(x x -x 29 y t - y 2 ) 

Now introduce four new variables 

^ miXi + m 2 x 2 m x yi+m 2 y 2 

mi 4- m 2 Wi + m 2 

xi 2 =xi —x 2 y i2 =yi — yi 

and show that this two-body problem can be reduced to two one-body problems, one involv- 
ing the center of mass of the system and one involving the relative motion of the two particles. 
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Give a physical interpretation of the ratio nhm 2 l(nh + m 2 ) that arises naturally in the relative 
motion. What is this quantity called? This result is easily extended to three dimensions. 

1-9. Extend the development of Problem 1-8 to the case in which each particle also ex- 
periences an external potential energy, say U(x u yi, zi) and U(x 2 , y 2 , z 2 ). Interpret the result- 
ing equations. 

1-10. Derive Lagrange’s equations for a particle moving in two dimensions under a central 
potential u(r). Which of these equations illustrates the law of conservation of angular mo- 
mentum? Is angular momentum conserved if the potential depends upon 0 as well? 

1-11. For a particle moving in three dimensions under the influence of a spherically sym- 
metrical potential U = U(r), write down the Lagrangian and the equations of motion in 
spherical coordinates (r, 0 , <f>). Show that H = K + V from 

H^PiQi-L 

for this potential. 

1-12. Solve the equation of motion of two masses m i and m 2 connected by a harmonic 
spring with force constant k. 

1-13. Start with Lagrange’s equations in Cartesian coordinates, that is, 

d_ /a/A _dL 
dt \0*/ ” dx 

and so on. Now introduce three generalized coordinates qi, q 2 , and q 3 which are related 
to the Cartesian coordinates by x = x(q l9 q 2 , q 3 )> and so on. Show that by transforming 
Lagrange’s equations from x , x, y, y 9 z, and z as independent variables to q u Q 2 ,q 2 , q 2y 
and q 3 we get 



and so on. 

1-14. If H, the classical Hamiltonian, does not depend explicitly on time, show that 
dHjdt = 0 . What does this mean physically? Is this true if H does depend explicitly upon 
time? 

1-15. Consider the rotation of a diatomic molecule with a fixed intemuclear separation / 
and masses mi and m 2 . By employing center of mass and relative coordinates, show that the 
rotational kinetic energy can be written in spherical coordinates as 

U($ 2 + <f> 2 sin 2 0) 

and from this derive the rotational Hamiltonian 

*— 5(* , + 5Fs) 

In these equations, I == pi 2 , where p is the reduced mass. This Hamiltonian is useful for 
studying the rotation of diatomic molecules. 

1-16. Show that the motion of a particle under a central force law takes place entirely in 
a single plane. 

1-17. What is the expectation (average) value for the linear momentum p x of a particle in 
a one-dimensional box p x 2 ? Briefly discuss your results. 

1-18. Show that the energy eigenvalues of a free particle confined to a cube of length a 
are given by 

h 2 

e = - — - (n x 2 + rt y 2 + n x 2 ) n x , n y , n x = 1, 2, . . . 

8 ma 2 
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1-19. Show that the energy eigenvalues of a free particle confined to a rectangular paral- 
lepiped of lengths a 9 b, and c are given by 


e 


hL (”jL n JL 

8 m\a 2 + b 1 + c 2 J 


n x ,n y ,n T = l,2 9 ... 


1-20. Calculate the energy eigenvalues of a particle confined to a ring of radius a. 

1-21. Show that the Hamiltonian operator of a rigid rotor is given by Eq. (1-32). 

1-22. Calculate the degeneracy of the first few levels of a free particle confined to a cube of 
length a. 

1-23. Verify the calculation that follows Eq. (1-37) which shows that the quantum 
mechanical degeneracy of a macroscopic system is 0(10*). 

1-24. We need to know the volume of an TV-dimensional sphere in order to derive Eq. 
(1-36). This can be determined by the following device. Consider the integral 


J = 



j e ~ ( Jt l 2 +Jf 2 2 + *--+*N 2 ) 


dx i dx 2 * * • dx N 


First show that J 7 r* /2 . Now one can formally transform the volume element dx i dx 2 * * * dx N 
to TV-dimensional spherical (hyperspherical) coordinates to get 

dx i dx 2 * • • dx N -» r N ~ 1 S N dr 

"'angles 

where S N is the factor that arises upon integration over the angles. Show that S 2 — 2n and 
S 3 “ 4tt. S n can be determined for any TV by writing I in hyperspherical coordinates: 



Show that I = S N T(TV/2)/2, where T(x) is the gamma function (see Problem 1-58). Equate 
these two values for / to get 

2tt"' 2 

Sn - r(A72) 

Show that this reduces correctly for TV — 2 and 3. Lastly now, convince yourself that the 
volume of an TV-dimensional sphere of radius a is given by 



and show that this reduces correctly for TV = 2 and 3. 

1-25. Derive an expression for the density of translational quantum states for a two- 
dimensional ideal gas. 

1-26. Prove that a many-body wave function must be either symmetric or antisymmetric 
under the interchange of any two particles. Hint: Apply the exchange operation twice. 

1-27. Show for an isothermal process that w rev > w> !rre v and Qr ev ^ (7lrrev • 

1-28. Derive the thermodynamic equation 

hem 

and evaluate this difference for an ideal gas and a gas that obeys the van der Waals equation. 
1-29. Derive the thermodynamic equation of state 
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1-30. Derive the equation 

and from this show that ( dE/dV) T = a/V 2 for a van der Waals gas. 
1-31. Show that 



1-32. Derive an expression for d In K/d Tin terms of A H, the heat of reaction, and in terms 
of C p , the heat capacity at constant pressure. 

1-33. Consider the “water-gas” reaction 

C0 + H 2 0(g)^H 2 + C0 2 

where 


K _ Ph 2 Pco 2 
p PcoPh 2 o 

and given the following data: 


Substance 

(kcal/mole) 

a 

6 x10 s 

cx 10 7 

Ai/ 2 9 8 (kcal/m) 

CO 

-32.81 

6.42 

1.67 

1.96 

-26.4157 

H 2 0(g) 

-54.64 

7.26 

2.30 

2.83 

-57.7979 

co 2 

94.26 

6.21 

10.40 

-35.45 

-94.0518 

h 2 

0.00 

6.95 

-0.20 

4.81 

0.00 


where the heat capacity of the gases in cal deg -1 mole -1 is given by 
C p = a + bT+cT 2 


Calculate K p at 298°K and 800°K. 

1-34. Calculate the free energy change at 700°C for the conversion of carbon monoxide 
at 10 atm and water vapor at 5 atm to carbon dioxide and hydrogen at partial pressures of 
1 .5 atm each. The equilibrium constant K p for this reaction is 0.71 . Is this process theoretically 
feasible? 

1-35. It is illustrated in Chapter 17 that the speed of sound c 0 propagated through a 
gas is 

Co = {mpKs)~ in 

where k s is the adiabatic compressibility 



Show that this is equivalent to 


Co 



where y = C p /C v , and Mis the molecular weight of the gas. Using the above result, show that 


Co = 



1/2 


for an ideal gas. 
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1-36. Jones and Giauque obtained the following values for C p of nitromethane. 


°K 

15 

20 

30 

40 

50 





100 

Cp 

0.89 

2.07 

4.59 

6.90 

8.53 

9.76 


11.47 

12.10 

12.62 

°K 

120 

140 

160 

180 

200 

220 

240 

260 

280 

300 

Cp 

13.56 

14.45 

15.31 

16.19 

17.08 

17.98 

18.88 


25.17 

25.35 


The melting point is 244. 7°K, heat of fusion 2319 cal/mole. The vapor pressure of the liquid 
at 298. 1°K is 3.666 cm. The heat of vaporization at 298.0°K is 9147 cal/mole. Calculate the 
third-law entropy of CH 2 N0 2 gas at 298.1 °K and 1 atm pressure (assuming ideal gas behavior). 
1-37. Derive the Legendre transformation of E in which S T and TV-^/lx. 

1-38. Apply a Legendre transformation to the Lagrangian L{q Jy qj) to eliminate the 
generalized velocities in favor of generalized momenta, defined by p s = dL/dqj. What function 
does this turn out to be? 

1-39. Find the natural function of V y E y and \x. Hint: Start with the natural function of V> 
E y and TV, namely, S, and transform TV-*jx. 

1-40. Derive the Legendre transformation of E in which S^T y TV->^, and What 
peculiar thing happens when all the extensive variables are transformed out? 

1-41. Show that (* — x) 2 = x 2 — x 2 . 

1-42. Show that the Poisson distribution P(m) = a m e~ a lm\ is normalized. Calculate fn and 
the variance. What is the significance of the parameter al 

1-43. Sketch the Gaussian distribution as a (or even ojx) becomes smaller and smaller. 
To what type of distribution does a Gaussian go in the limit or->0. Discuss the meaning of 
this distribution. 

1-44. For the Gaussian distribution p(x) show that 

(a) 

r °° 

p(x) dx = 1 

J —oo 

(b) Calculate the «th central moment where n =0, 1,2, and 3. 

(c) In the limit a ->0 what kind of distribution is approached where 



1-45. The quantity (x — x) J is called the y'th central moment. Show that all odd central 
moments of a Gaussian vanish. What about the even ones? Relate the j = 2 central moment 
to the parameter a. 

1-46. Let f(x y y) be a joint probability density, that is, f(x y y) dx dy is the probability that 
X lies between x and x + dx and Y lies between y and y -1- dy. If X and Y are independent then 

fix , y) dx dy =fiix)f 2 (y) dx dy 

If X and Y are independent, show that the mean and variance of their sum is equal to the sum 
of the means and variances, respectively, of X and Y ; that is, show that if W = X+ Y y then 

W=X+Y 

(W— W) 2 =(X-X) 2 -KL-F 2 ) 

1-47. Let A" be a random variable on the positive numbers, 0 ^ x < oo, and let p(x) be its 
probability density function. The function <f>(s) defined by 

= f e~ sx p(x)dx 
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is called the characteristic function of p(x). Find the relation between <f>(s) and the moments 
of p(x). Is knowledge of all the moments of p(x) (assuming they exist) sufficient to specify 
p(x) itself? Why or why not? 

1-48. Show that the characteristic function of the density function of the sum of two 
independent random variables is the product of the characteristic functions of the densities 
of the two random variables themselves. What is the density function of W = X+ Y! 

1-49. Maximize 


W(N U N 29 ... 9 N m ) = 


with respect to each Nj under the constraints that 

^Nj = N = a fixed constant 
X Ej Nj = S = another fixed constant 


Hint: Consider the N/s to be continuous, large enough to use Stirling’s approximation of 
Nj !, and leave your answer in terms of the two undetermined multipliers. 

1-50. Show that the maximum of a multinomial distribution is given when Ni—N 2 = mmm 
N s = N/s. 

1-51. Use the method of undetermined multipliers to show that 

-£ PjlnPj 
j=i 

subject to the condition 

N 

£ Pj = i 
j= i 

is a maximum when Pj = constant. 

1-52. Consider the sum 


w Mix" 

N\(M-N)\ 

where x = 0(1), and M and N are O(10 20 ). First show that In X = M In (1 -j- *) exactly , and 
then calculate the logarithm of the maximum term. Hint: Remember the binomial expansion. 

1-53. Show that the higher terms that were dropped in the expansion of In f(N) in Eq. 
(1-84) are completely negligible for large values of N and M. 

1-54. The Planck blackbody distribution law 

h a* 3 d<o 

p(co, T)da>= 

rv tt 2 c 3 exp (fiftto) — 1 


gives the blackbody radiation energy density between frequencies w and w -1- dw. (ft = hj 2™, 
cd = 27 Tv y and e — hv = ftco.) Substitute this into 


E r 00 

v = j 0 ^ dw 

to derive the temperature dependence of El V. Do this by expressing your result as a group of 
factors multiplying a dimensionless integral. You do not need to evaluate this integral. 

1-55. Show that e x l(l ± e x ) 2 is an even function of x . 

1-56. The heat capacity of the Einstein model of a crystal is given by 


Cv = 3Nk 



e G *' T 


(e* E,T -l ) 2 
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where 0 £ is the “characteristic temperature” of the crystal. Determine both the high- and 
low-temperature limiting expressions for the heat capacity. Do the same thing for the Debye 
model of crystals, in which 




© D /r e x dx 


0 e x ~l ) 


where © D is the Debye temperature of the crystal. 

1-57. Recognizing it as a geometric series, sum the following series in closed form: 


S = £ *- 

n= 0 

Compare this result to 


/ = e~ an dn 
J o 

Under what conditions are these two results the same? 

1-58. One often encounters the gamma function in statistical thermodynamics. It was 
introduced by Euler as a function of x t which is continuous for positive values of x and which 
reduces to n\ when x — n, an integer. The gamma function T(x) is defined by 


T(x)= r e ‘t x ~ l dt 
J o 

First show by integrating by parts that 
T(x+l)=xT(x) 


Using this, show that T(« + 1) = n\ for n an integer. Show that 
r(i) =Vn 


Evaluate L(i) using the recurrence formula T(jc + 1) = xr(jt). Lastly show that 

(2 «-l) 


/ 1\ 1 • 3 • • ■ (2h — 1) /1\ 

r (”+2) r — r y 




2 2n n\ 


For a discussion of the gamma function, see G. Arf ken, Mathematical Methods for Physicists , 
2nd ed. (New York: Academic, 1970). 

1-59. We can derive Stirling’s approximation from an asymptotic approximation to the 
gamma function r(x). From the previous problem 


T(N+1) = N\=\ e-*x N dx 
J o 

f 00 

= e Na{x) dx 
J o 

where g(x) = In x — xjN. If g(x) possesses a maximum at some point, say x 0 , then for large 
N 9 exp (Ng(x)) will be extremely sharply peaked at x 0 . Under this condition, the integral for 
N\ will be dominated by the contribution of the integrand from the point x 0 . First show that 
g(x) does, in fact, possess at maximum at the point x 0 = N. Expand g{x) about this point, 
keeping terms only up to and including (x — N) 2 to get 


(*-A0 2 


+ **• 


g(x) « g(N) 


2 
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Why is there no linear term in (jc — AO? Substitute this expression for g(x) into the integral 
for N\ and derive the asymptotic formula 

In Nl » TV In N - N + ln(27r N) U2 

1-60. Verify the energy conversion factors in Appendix A. (The one labeled “ temperature ” 
means that temperature required to give an energy equal to kT \ where k is the Boltzmann 
constant.) 

1-61. An integral that appears often in statistical mechanics and particularly in the kinetic 
theory of gases is 



This integral can be readily generated from two basic integrals. For even values of n, we first 
consider 



The standard trick to evaluate this integral is to square it, and then transform the variables 
into polar coordinates. 


Io 2 



V 0 ” 2 dx dy 


»co «n/2 
^0 ^0 


e-^rdrdd 


7T 

4 a 



Using this result, show that for even n 


In 


1 


3-5 — (#i-l) 
2(2 aY' 2 



n even 


For odd values of w, the basic integral h is easy. Using l u show that 


L 



2^01 + 1)12 


n odd 


1-62. Show that a Gaussian distribution is extremely small beyond a few multiples of a . 
1-63. Another function that occurs frequently in statistical mechanics is the Riemann zeta 
function, defined by 


= £ *- 
k — 1 

First show that £(1) = oo, but that ((s) is finite for s > 1. Show that another definition of 

£(*) is 



34 INTRODUCTION AND REVIEW 


that is, show that this is identical to the first definition. In addition, show that 

ij»= £ (— I)‘- 1 A:-» = a -2 1 -*)£(*) 

1 

MA = 11 (2k + l)-‘ = (1 - 2~’)£(s) 

k = 0 

The evaluation of £($) for integral s can be done using Fourier series, and some results are 
£(2) = tt 2 /6 and £(4) = tt*/90. 

For a discussion of the Riemann zeta function, see G. Arfken, Mathematical Methods for 
Physicists , 2nd ed. (New York: Academic, 1970). 



CHAPTER 2 


THE CANONICAL 
ENSEMBLE 


In this chapter we shall introduce the basic concepts and assumptions of statistical 
thermodynamics, and then apply them to a system which has fixed values of V and N 
and is in thermal equilibrium with its environment. We shall derive the fundamental 
connection between the quantum mechanical energy levels available to an N-body 
system and its thermodynamic functions. This link is effected by a function, called the 
partition function, which is of central importance in statistical thermodynamics. In 
Section 2-4 we discuss the relevance of the statistical thermodynamic equations to 
the second and third laws of classical thermodynamics. 

2-1 ENSEMBLE AVERAGES 

Our goal is to calculate thermodynamic properties in terms of molecular properties. 
Given the structure of the individual molecules of our system and the form of the 
intermolecular potential, we wish to be able to calculate thermodynamic properties, 
such as entropy and free energy. We shall do this first with respect to mechanical 
properties (such as pressure, energy, volume), which are quantum mechanical or 
classical mechanical quantities, and then we shall bring nonmechanical thermodynamic 
variables (such as entropy, free energy) into our discussion by appealing to the equa- 
tions of thermodynamics. One useful distinction between mechanical and non- 
mechanical properties is that mechanical properties are defined without appealing 
to the concept of temperature, whereas the definitions of nonmechanical properties 
involve the temperature. 

Consider some macroscopic system of interest, such as a liter of water or a salt 
solution. From a macroscopic point of view, we can completely specify a system by a 
few parameters, say the volume, concentration or density, and temperature. Regard- 
less of the complexity of the system, it requires only a small number of parameters to 
describe it. From a microscopic point of view, on the other hand, there will be an 
enormous number of quantum states consistent with the fixed macroscopic properties. 
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We saw in Chapter 1 that the degeneracy of an isolated N-body system is of the order 
of 10" for all but the very lowest energies. This means that the liter of water or the salt 
solution could be in any one of the order of 10* possible quantum states. It would be 
impossible for us to ever determine which of the order of 10* possible states the 
system is in. The state of the system must be known, however, in order to calculate a 
mechanical thermodynamic property, such as the pressure, since the values of that 
property in each of the possible quantum states would, in general, be different. Thus 
we are faced with what appears to be an impossible task. 

It is at this point that we appeal to the work of Maxwell, Boltzmann, and particularly 
Gibbs. The modern (postquantum) version of their approach is that in order to cal- 
culate the value of any mechanical thermodynamic property (say, the pressure), one 
calculates the value of that mechanical property in each and every one of the quantum 
states that is consistent with the few parameters necessary to specify the system in a 
macroscopic sense. The average of these mechanical properties is then taken, giving 
each possible quantum state the same weight. We then postulate that this average 
mechanical property corresponds to a parallel thermodynamic property. For 
example, we postulate that the average energy corresponds to the thermodynamic 
energy and that the average pressure corresponds to the thermodynamic pressure. It 
turns out that the calculation of a mechanical property averaged over all the consistent 
quantum states can be readily performed. Before doing this, however, we shall intro- 
duce some concepts that will make this procedure clearer. 

We first discuss the concept of an ensemble of systems, first introduced by Gibbs. 
An ensemble is a (mental or virtual) collection of a very large number of systems, say 
sf, each constructed to be a replica on a thermodynamic (macroscopic) level of the 
particular thermodynamic system of interest. For example, suppose the system has a 
volume V , contains N molecules of a single component, and is known to have an 
energy E. That is, it is an isolated system with N, V, and E fixed. Then the ensemble 
would have a volume s#V, contain s#N molecules, and have a total energy & = s#E. 
Each of the systems in this ensemble is a quantum mechanical system of N interacting 
atoms or molecules in a container of volume V The values of TV and V, along with the 
force law between the molecules, are sufficient to determine the energy eigenvalues 
Ej of the Schrodinger equation along with their associated degeneracies Q(£ 7 ). These 
energies are the only energies available to the TV-body system. Hence the fixed energy 
E must be one of these E/s and, consequently, there is a degeneracy Q(£). Note that 
there are Q(E) different quantum states consistent with the only things we know about 
our macroscopic system of interest, namely, the values of TV, V , and E. Although all 
the systems in the ensemble are identical from a thermodynamic point of view, they 
are not necessarily identical on a molecular level. So far we have said nothing about 
the distribution of the members of the ensemble with respect to the Q(£) possible 
quantum states. 

We shall further restrict our ensemble to obey the principle of equal a priori prob- 
abilities. That is to say, we require that each and every one of the Q(£) quantum states 
is represented an equal number of times in the ensemble. Since we have no information 
to consider any one of the Q(E) quantum states to be more important than any other, 
we must treat each of them equally, that is, we must utilize the principle of equal 
a priori probabilities. All of the ft(£) quantum states are consistent with the given 
values of TV, V , and E, the only information we have about the system. Clearly, the 
number of systems in the ensemble must be an integral multiple of ft(£). The number 
of systems in an ensemble is a very large number and can be made arbitrarily large by 
simply doubling, tripling, and so on, the size of the ensemble. An alternative inter- 
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pretation of the principle of equal a priori probabilities is that an isolated system 
(N, F, and E fixed) is equally likely to be in any of its Q(£) possible quantum states. 

We now define an ensemble average of a mechanical property as the average value 
of this property over all the members of the ensemble, utilizing the principle of equal 
a priori probabilities. We postulate that the ensemble average of a mechanical property 
can be equated to its corresponding thermodynamic property. 

There are two complications in the above treatment that we should mention; 
neither of them, fortunately, is of any practical consequence. We have assumed that 
the isolated system that we have been using as an example has precisely the energy 
£. We know, however, from quantum mechanics that there always exists a small 
uncertainty A £ in the value of £. For all thermodynamic purposes, this complication 
is completely inconsequential, and we shall therefore ignore it. The explanation of the 
other complication involves a greater knowledge of quantum mechanics than is 
generally required in this book. We have assumed that the systems of the ensemble 
are in one of the Q(£) degenerate eigenstates having the eigenvalue £. The choice of 
these Q(E) eigenfunctions, however, is somewhat arbitrary since any linear combina- 
tion of these is also an eigenfunction with energy £. Moreover, a quantum mechanical 
system will, in general, not be in one of the Q(£) selected states, but will be some linear 
combination of them. Thus we have tacitly assumed that any system with N, F , and £ 
given will be a “ pure state,” whereas a system with N, F, and £ given will most likely 
be in a “ mixed state,” that is, in a state described by a linear combination of the pure 
states we have chosen. In any event, this complication need not be considered, since 
the results do not differ appreciably from those obtained from the simpler and more 
naive point of view which we have presented above and now adopt. 

Let us summarize this section by stating that we wish to calculate the ensemble 
average of some mechanical property, and then show that this can be set equal to the 
corresponding thermodynamic property. We have stated above that the calculation 
of the ensemble average is not difficult, and now we shall address ourselves to that 
problem. As Schrodinger says in his book:* “There is, essentially, only one problem 
in statistical thermodynamics, the distribution of a given amount of energy £ over 
identical systems. Or perhaps better, to determine the distribution of an assembly of 
identical systems over the possible states in which the system can find itself, given that 
the energy of the assembly is a constant £.” 

So far, in this section, we have focused our attention on an ensemble whose members 
have N , F, and £ fixed. This is called the microcanonical ensemble and is useful for 
theoretical discussions. For more practical applications, however, we consider not 
isolated systems, but those in which the temperature rather than the energy is fixed. 
The most commonly used ensemble in statistical thermodynamics is the canonical 
ensemble , in which the individual systems have N , F, and T fixed. The remainder of 
this chapter will deal with the canonical ensemble. There are many other types of 
ensembles, in fact, one for each set of thermodynamic variables that are used to specify 
an individual member of the ensemble. We shall discuss some of these other ensembles 
in the next chapter. 

2-2 METHOD OF THE MOST PROBABLE DISTRIBUTION 

Consider an experimental system with N> V, and T as its independent thermodynamic 
variables. We can mentally construct an ensemble of such systems in the following 
manner. We enclose each system in a container of volume F with walls that are heat 

* E. Schrodinger, Statistical Thermodynamics (Cambridge: Cambridge University Press, 1952). 
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conducting but impermeable to the passage of molecules. The entire ensemble of 
systems is then placed in a very large heat bath at temperature T. When equilibrium 
is reached, the entire ensemble is at a uniform temperature T. Since the containing 
walls of each system are heat conducting, each and every system of the ensemble has 
the same fixed values of N , V, and T. Now, the entire ensemble is surrounded by 
thermal insulation, thus making the ensemble itself an isolated system with volume 
sfV, number of molecules sfN, and some total energy g. (The actual value of & is 
not important.) Each of the st members of the canonical ensemble finds itself in a 
large heat bath at temperature T 

Because each of the systems of the canonical ensemble is not isolated but is at a 
fixed temperature, the energy of each system is not fixed at any set value. Thus we 
shall have to consider the entire spectrum of energy states for each member of the 
canonical ensemble. Let the energy eigenvalues of the quantum states of a system be 
E t (N, V) 9 E 2 (N , V ) . . . , ordered such that E J+l > Ej. It is important to understand 
here that any particular energy, say E t , is repeated according to its degeneracy, that 
is, occurs Cl(Ei) times. Any particular system might be found in any of these quantum 
states. We shall show later that the average energy or the probability that some system 
has a certain energy depends upon the temperature; however, any of the set of energies 
{Ej} is possible, and so must be considered. 

We can specify a state of the entire ensemble by saying that a u a 2 , a 3 , of the 
systems are in states 1, 2, 3, ... , respectively, with energies E u E 2 , E 3 , ... . Thus we 
can describe any one state of the ensemble by writing 

State No. 1, 2, 3, /... 

Energy E u E 2 , E 3 , . . . , E t . . . 

Occupation No. a u a 2 , a 3 , ..., a t ... 

Occupation Number means the number of systems of the ensemble in that particular 
state. The set of occupation numbers is called a distribution. We shall often denote 
the set {a-} by a. 

Of course, the occupation numbers satisfy the two conditions: 

I,aj = sf ( 2 - 1 ) 

J 

£ a jEj — & (2-2) 

J 

The first condition simply accounts for all the members of the ensemble, and the 
second represents the fact that the entire canonical ensemble is an isolated system, 
and hence has some fixed energy 

Since the canonical ensemble has been isolated from its surroundings by thermal 
insulation, we can apply the principle of equal a priori probabilities to this isolated 
system. In the form that we wish to use here, the principle of equal a priori probabili- 
ties says that every possible state of the canonical ensemble, that is, every distribution 
of occupation numbers a l9 a 2 , consistent with Eqs. (2-1) and (2-2) is equally 
probable and must be given equal weight in performing ensemble averages. 

The number of ways a) = W(a l9 a 2 , a 3 , . . .) that any particular distribution of 
the a- s can be realized is the number of ways that s£ distinguishable objects can be 
arranged into groups, such that a t are in the first group, a 2 in the second, and so on 
[see Eq. (1-77)]: 

x s4\ s4\ 


(2-3) 
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The systems are distinguishable since they are macroscopic systems, which we could, 
in principle, furnish with labels. 

In general, there are very many distributions which are consistent with Eqs. (2-1) 
and (2-2). In any particular distribution, ctj/s# is the fraction of systems or members 
of the canonical ensemble in the jth energy state (with energy £,-). The overall probabil- 
ity Pj that a system is in the jth quantum state is obtained by averaging a/stf over all 
the allowed distributions, giving equal weight to each one according to the principle 
of equal a priori probabilities. Thus Pj is given by 


J _ 1 la WifiafiL) 
s! W( a) 


In Eq. (2-4), the notation a/a) signifies that the value of a 5 depends upon the distribu- 
tion, and the summations are over all distributions that satisfy Eqs. (2-1) and (2-2). 
We shall later let oo, but the ratio a ystf will remain finite since dj -► oo as well. 

Given the probability that a system with fixed values of N , V, and T is in the jth 
quantum state, one can calculate the canonical ensemble average of any mechanical 
property from 

M =l M jPj (2-5) 

j 

where M } is the value of M in the jth quantum state. Thus the prescription for calculat- 
ing the ensemble average of any mechanical property is given by Eqs. (2-4) and (2-5) 
and is, in principle, complete. The summations involved in Eq. (2-4), however, are 
very difficult to perform mathematically, and thus in practice Eqs. (2-4) and (2-5) 
are too complicated to use. 

The fact that we can let s4 -► oo, however, allows us to appeal to the results of 
Section 1-5. We have seen there that multinomial coefficients, such as JT(a), are 
extremely peaked about their maximum value if all the variables a s are large. In 
Eqs. (2-1) through (2-4), each of the a/s can be made arbitrarily large since s# can 
be made arbitrarily large. Thus we can use an argument here very similar to that used 
in Section 1-5. We need make only one modification or extension. We have shown that 
W( a) is a maximum when all the a/s are equal, under the one constraint Eq. (2-1). We 
have now an additional constraint Eq. (2-2) on the a/s. So instead of peaking at the 
point at which all the a/s are equal, it will peak at some other set of a/s but the spread, 
nevertheless, will be arbitrarily small. We shall determine this set of a/s shortly. Let us 
denote this distribution by a* = {a/*}. 

The spread of W(sl) about its maximum value can be made arbitrarily narrow by 
taking the aj , that is, st, to be arbitrarily large. Thus the W( a) in Eq. (2-4) at any 
set of a/s other than the set a*, which maximizes JT(a), are completely negligible. We 
can replace the summations in Eq. (2-4) over all distributions by just one term, 
evaluated at a*. Thus we can write 




1 I. ffWo/a) 


E. ^(«) 


1 W(a*)dj* a/ 
It *F(a*) ~~ 


= ~~7 (lim aj -> oo) 


( 2 - 6 ) 


where a j* is the value of a 5 in that distribution that maximizes W( a), that is, the most 
probable distribution. The name of this section, the method of the most probable 
distribution, is derived from Eq. (2-6). Comparing Eqs. (2-6) with (2-4), we have 
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Thus, to calculate the probabilities to be used in ensemble averages, we need determine 
only that distribution a* that maximizes W( a) under the two constraints Eqs. (2-1) 
and (2-2). This is the problem to which we now turn. 

As this is a problem of maximizing a function of many variables with given con- 
straints on the variables, we have a direct application of Lagrange’s method of undeter- 
mined multipliers. Following Section 1-5, the set of a/s that maximizes W( a), subject 
to Eqs. (2-1) and (2-2), is found from 

{in *F(a) - a j= 1,2,... (2-8) 

where a and p are the undetermined multipliers. Using Eq. (2-3) for W( a) along with 
Stirling’s approximation (which is exact here since each of the a/s can be made 
arbitrarily large), one gets 


— In a* — a — 1 — pEj = 0 

j= 1,2,... 

(2-9) 

or 



a* = e~ a e~ fiEj 

.7=1,2,... 

(2-10) 


where a' = a + 1 . (See Problem 2-3.) This gives us the most probable distribution in 
terms of a and p. We now evaluate a' and p by using Eqs. (2-1) and (2-2) along with 
physical arguments. 


2-3 THE EVALUATION OF THE 

UNDETERMINED MULTIPLIERS, a AND p 

We can obtain an expression for a (or a') in terms of p by summing both sides of 
Eq. (2-10) over j and using Eq. (2-1) to get 


e*' = — 


Equation (2-7) thus becomes 

* - PEj(N,V ) 

p . — _L. — 

J S# y. e -PEjiN,V) 


( 2 - 11 ) 


( 2 - 12 ) 


Substituting this into Eq. (2-5), with Ej taken to be the mechanical property, gives 


E = E(N 9 V, P) = 


Ej(N 9 V) e -P E * N ’ V) 
j? je -pEAN,V) 


(2-13) 


According to the postulate of the ensemble method of Gibbs, this average energy 
E(N, V 9 p) corresponds to the thermodynamic energy E. 

The pressure is another important mechanical variable. When a system is in the 
state j\ dEj = — pj dV is the work done on the system when its volume is increased 
by dV (keeping the number of particles in the system fixed). Thus the pressure in the 
state j is given by 




(2-14) 
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The canonical ensemble average of p } is 


P = 'L Pj p j = 

j 



(2-15) 


We postulate that p corresponds to the thermodynamic pressure. 

The sum in the denominator of Eqs. (2-13) and (2-15) occurs throughout the equa- 
tions of the canonical ensemble. Let this be denoted by Q(N , V , fi): 

Q(N 9 V 9 p) = £ e - pE * N - v) (2-1 6) 

j 

We shall see that this function Q(N 9 V, p) is the central function of the canonical 
ensemble. 

We have made two connections with thermodynamics: 


P++P 

£<-►£ 


(ensemble postulate of Gibbs) 


Equation (2-13) gives £ as a function of /?. In principle, one could solve this equation 
for j? as a function of E 9 but in practice this is not feasible. Fortunately p turns out to 
be a more convenient quantity than E, so much so that it is preferable to have E as a 
function of p rather than the inverse. We shall now evaluate p in two different ways. 

We differentiate Eq. (2-13) with respect to V 9 keeping N and p fixed: 



— p + pEp — pEp 


In this equation. 


— £ jPjEje-^ 

and 








PEj 


. 1 j Pi 


PEj 


(2-17) 


Similarly, we can differentiate Eq. (2-15) to get 


dp 


= Ep — Ep 

N,V 


From Eqs. (2-17) and (2-18) we get 


(2-18) 



(2-19) 


Note that £ is a function of N 9 V 9 and p 9 whereas the E/s are functions of N and V 
only. This is an important distinction, that should be clearly and completely under- 
stood. 

Let us now compare Eq. (2-19) with the purely thermodynamic equation. (See 
Problem 1-29.) 



P 


( 2 - 20 ) 
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which we rewrite in terms of l/T instead of T : 

' <2 - 2,) 

A comparison of Eq. (2-19) with Eq. (2-21) allows us to deduce that p = const /T. 
It is customary to write p = 1 /kT, where k is a constant, whose value could possibly 
vary from substance to substance. We shall show now, however, that k has the same 
value for all substances, that is, A: is a universal constant. 

Consider two closed systems A and B , each having its own kind of particles and 
energy states, but in thermal contact with each other and immersed in a heat bath of 
temperature T. We now construct a canonical ensemble of systems AB (as shown in 
Fig. 2-1) representative of a thermodynamic AB system at temperature T and apply 
the method of the most probable distribution to the AB system. Let the number of 
molecules and volume of the A and B systems be N A , V A , and N B , V B , respectively, 
and let their energy states be denoted by {E JA } and {E jB }. If a } denotes the number of 
A systems in state Eja, and bj denotes the number of B systems in state Ej B , then the 
number of states of the AB ensemble with compound distribution {«_,-} and {bj} is 


fV(a, b) = 


s4\ m 

n^’rpri 


(2-22) 


where and 88 (s# = 88) are the number of A and B system, respectively. Equation 
(2-22) turns out to be a product of the separate A and B factors, because we can arrange 
the A systems over their possible quantum states independently of the B systems, and 
vice versa. The a/s and b/s must satisfy the three relations: 

I = -^ 

J 

'£b j = @ = st 
j 

Z ( a jE jA + bjEjg) = S (2-23) 

j 

We now apply the method of the most probable distribution to Eqs. (2-22) and 
(2-23) to get Problem 2-9 for the simultaneous probability that the AB system has its 
A part in the ith quantum state and its B part in the jth quantum state: 

g-fiEiA g-PEjB 

p, ‘~-q7-qT~ p ' aP " 

where 


Q A = Z e pEkA and Q B = £ e pEkB 

k k 


(2-25) 


n 
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B 


A 
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B 
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B 


A 

B 


A 

B 






Figure 2-1 . Canonical ensemble of composite AB systems. 
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Thus we have shown that two arbitrary systems in thermal contact have the same /?. 
But we have seen from Eqs. (2-19) and (2-21) that p = 1/kT, and so the two systems 
must have the same value of k. Since the nature of the two systems is completely 
arbitrary, k must have the same value for all systems. Thus fc is a universal constant 
and can therefore be evaluated using any convenient system. The most convenient is 
an ideal gas, and one can determine from the equation of state of an ideal gas 
[cf. Eq. (5-18)] that k = 1.3806 x 10“ 16 erg-deg~\ where the temperature is in units 
of degrees Kelvin. 

There is an alternative way to determine p which utilizes the fact that 1/T is an 
integrating factor of dq Tev . We shall present this argument here, since it will bring the 
nonmechanical property of entropy into our formalism. 

The argument based around Fig. 2-1 shows that if two systems are in thermal 
contact at equilibrium, they have the same value of /?. Since the two systems can be 
quite arbitrary, this implies that p must be some function of the temperature. We shall 
now show that /? dq rey is an exact differential. 

Consider the function /= In Q. We regard / as a function of p and all the E/s: 

f(P, E U E 2 = In {£ e (2-26) 

The total derivative / is 


df= 




(2-27) 


The partial derivatives occurring here are determined from Eq. (2-26) to be 




= -E 



= ~PP k 


Thus Eq. (2-27) becomes 


4 T“ -Edp-pYPjdEj 
j 

which can be written as 


<Hf+ PE) = p\dE - £ Pj dEjj (2-28) 

We now subject the ensemble of systems to the following physical process. We 
change the volume of all the systems by dV 9 changing, of course, the E/s for all 
of them alike in order to still have an ensemble of macroscopically identical systems. 
We also change the temperature of the ensemble by dT by coupling it with a large heat 
bath (of the same temperature), changing the temperature slightly and then isolating 
the ensemble from the heat bath. 

If initially there were a } systems of the ensemble in the energy state j with energy 
Ej, then a s dEj is the work done on all these systems in changing the energy from 
Ej to Ej + dEj . The total work done on the ensemble is £ dEj and £ jPjdEj 
is the ensemble average reversible work that we do on the systems. And since dE is 
the average energy increase, the term enclosed in parentheses on the right-hand side 
of Eq. (2-28) is the average reversible heat supplied to a system. Thus Eq. (2-28) is 

d(f+ PE) = p Sq rty (2-29) 
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which says that p Sq rty is the derivative of a state function, that is, that p is an integrating 
factor of Sq rey . One statement of the second law of thermodynamics says that p must 
be equal to constant/T, or \/kT. 

The left-hand side of Eq. (2-29), therefore, must be dS/k, and so we can write that 

E 

S' = — + k\nQ + constant (2-30) 

where the constant is independent of T and of the parameters (N, V, and so on) on 
which the E/s depend. Since thermodynamics deals with A S only, the constant will 
always drop out of any calculations of entropy changes for chemical and/or physical 
changes. We shall, therefore, set this constant to zero and discuss the implications of 
this at the end of the chapter. 

In the above argument that p was an integrating factor of 5q rty , we used the fact that 
the average work done on a system was Pj dEj . We do work, then, by changing the 
energies slightly, but keeping the population of these states fixed (the P/s do not 
change). A molecular interpretation of thermodynamic work, then, is a change in the 
quantum mechanical energy states of the system, keeping the population over them 
fixed. That a molecular interpretation of the absorption of heat is the inverse of this 
can be seen from 

<® = L Ej dPj + 1 Pj dEj 

j j 

= <5<7«v - <5fc>,ev 

Thus when a small quantity of heat is absorbed from the surroundings, the energy 
states of the system do not change (N and V are fixed), but the population of these 
states does. 


2-4 THERMODYNAMIC CONNECTION 


We now complete the connection between thermodynamics and the canonical 
ensemble. Equation (2-13) for E can be written as (see Problem 2-10): 

< 2 - 3 ” 

and we can also easily derive (see Problem 2-10) 


, (d In Q\ 

'’-"bid., < 2 - 32 > 

from Eq. (2-15). Equation (2-30) is an equation for the entropy S in terms of Q: 

S = kT (^S) (2-33) 

\ vT J NtV 


We have E, p , and S now as functions of Q, and so it is possible to derive expressions 
for all the thermodynamic functions in terms of Q. The function Q is the central 
statistical thermodynamic function of the canonical ensemble (N, V y and T fixed) and 
is called the canonical ( ensemble ) partition function : 

Q(N 9 V y T) = Y,e~ Emv)lkT 
J 


(2—34) 
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The partition function serves as a bridge between the quantum mechanical energy 
states of a macroscopic system and the thermodynamic properties of that system. If 
we can obtain Q as a function of TV, F, and T, we can calculate thermodynamic 
properties in terms of quantum mechanical and molecular parameters. Although the 
E/s are the energy states of an TV-body system and consequently appear to be unobtain- 
able in practice, we shall see that in a great many cases, we shall be able to reduce the 
TV-body problem to a one-body, two-body, three-body problem, and so on, or approxi- 
mate the system by classical mechanics. Both of these routes turn out to be very useful. 
For now, however, we need only assume that there is such a set of energies. 

We can derive an equation for the Helmholtz free energy A in terms of Q by using 
Eqs. (2-31) and (2-33) along with the fact that A = E — TS. The result is 

A(N, V y T)=-kT In Q(N, F, T) (2-35) 

Notice that of all the thermodynamic functions, it is A that is directly proportional to 
In Q(N, F, T), and that A is the thermodynamic potential whose natural independent 
variables are those of the canonical ensemble. Equation (2-35) can be considered to 
be the most important connection between thermodynamics and the canonical parti- 
tion function, since it is possible to derive many equations starting with its differential 
form (see Problem 2-11). Table 3-1 contains a summary of the formulas of the 
canonical ensemble. 

In this chapter, we have developed the connection between thermodynamics and 
the quantum mechanical states available to a macroscopic system characterized by 
TV, F , and T. This connection can be summarized by Eq. (2-35). Before concluding 
this chapter, we shall discuss the second and third laws of thermodynamics from a 
statistical thermodynamic point of view. A statement of the second law of thermo- 
dynamics for closed, isothermal systems is that AA < 0 for a spontaneous process. We 
wish to derive this inequality starting with Eq. (2-35). To do this, we first write 
Eq. (2-34) in a slightly different form 

Consider Eq. (2-34) for g(TV, F, T). The summation is over all the possible quantum 
states of the TV-body system. In carrying out the summation, a particular value of 
exp (—Ej/kT) will occur Q(£y) times, where Cl(Ej) is the degeneracy. Instead of listing 
exp (—Ej/kT) £l(Ej) times, we could simply write ft(£ J )exp( — Ej/kT\ and then sum 
over different values of E. If we do this, Eq. (2-34) is 

G(TV, F, T) = £ 0(1 V, F, E)e~ E(N ' V)/kT (2-36) 

E 

where we have dropped the no longer necessary j subscript of Ej . In Eq. (2-34), we 
sum over the states of the system. In Eq. (2-36) we sum over levels. Equation (2-36) 
is a more useful form for discussing the second law of thermodynamics. 

Consider a typical spontaneous processes, such as the expansion of a gas into a 
vacuum. Figure 2-2 shows the initial and final states of such a process. For simplicity, 
we consider the entire system to be isolated. Initially the gas might be confined to one 
half of the container. After removing the barrier, the gas occupies the entire container. 


o 
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Figure 2-2. The initial and final states of the expansion of a gas into a vacuum. 
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Equation (1-36) for Q(N, V 9 E ) of an ideal gas shows that the number of states is 
proportional to V N . For the process illustrated in Fig. 2-2, the gas goes from a thermo- 
dynamic state of energy E, number of particles N, and volume V/2 to one with the 
same energy E (the system is isolated), the same number of particles N, but with 
volume V Thus according to Eq. (1-36), the number of quantum states available or 
accessible to the system is increased. 

Another example of a spontaneous process is the following. Initially we have an 
isolated system containing a mixture of hydrogen and oxygen gases. Although hydrogen 
and oxygen react to form water, in the absence of a catalyst the reaction is so slow 
that we can ignore it. Since the rate of this reaction (uncatalyzed) is very slow com- 
pared to any thermodynamic measurement, we can consider the mixture of hydrogen 
and oxygen to be simply a mixture of two gases in equilibrium. If we now add a small 
amount of catalyst to the system, the hydrogen and oxygen will readily form water, so 
that the system contains hydrogen, oxygen, and water. Thus the addition of a small 
amount of catalyst makes all the energy states associated with water molecules available 
or accessible to the system, and, hence, the system proceeds spontaneously to populate 
these states. Since the originally accessible states are still accessible (there is still some 
hydrogen and oxygen in the system), this spontaneous process is associated with an 
increase in the number of states accessible to the system by the removal of some 
constraint. In this case, the constraint was a high activation energy barrier, which was 
removed by the addition of the catalyst. 

Both of the spontaneous processes that we have discussed occurred, because some 
restraint, inhibition, or barrier was removed which made additional quantum states 
accessible to the system. In general, any spontaneous process in an isolated system 
can be viewed in this manner. The removal of some constraint allows a greater number 
of quantum states to be accessible to the system, thus the ‘‘flow” of the system into 
these states is observed as a spontaneous process. 

The above discussion is limited to isolated systems. In order to discuss the condition 
A A < 0, we must now consider isothermal processes. When a system is in a heat bath 
rather than isolated, we must include all possible energy states or levels of the system. 
When a restraint is removed, the number of accessible quantum states of each and every 
energy E cannot decrease, and will usually increase, since the original states are still 
available. Thus we have that Q 2 (N, V, E) > V, E) for all E y where the subscripts 
1 and 2 denote the initial and final states, respectively. We now use this inequality 
along with Eq. (2-36) to show that A A < 0. Since no term can be negative and many 
are positive, we have 

Qi - Qi = Z {^(N, V, E) — Q t (N, V, E)}e~ E,kT > 0 (2-37) 

E 

In Eq. (2-37) we sum over all the levels available to the final state. It follows im- 
mediately from the inequality in Eq. (2-37) that 


AA =A 2 - A x = —kT In 



(2-38) 


for a spontaneous isothermal process, and thus we have written the second law of 
thermodynamics in terms of Eq. (2-35). 

Lastly, we consider the implications of putting the “constant” of Eq. (2-30) equal 
to zero. We shall see that this gives us a statistical thermodynamic version of the third 
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law of thermodynamics. If we write Eq. (2-33) for S more explicitly, we get 


S = k ln£e- £ ; /kr 

j 


+ 


1 YjEje~ E - kT 
T Yje-W T 


(2-39) 


We wish to study the behavior of this equation as T -> 0. Assume for generality that 
the first n states have the same energy (E t = E 2 = * • * = E n ) and that the next m states 
have the same energy (E n+i = E n+2 = • • • = E n+m ), and so on. Then in the limit of 
small T, Eq. (2-39) becomes (see Problem 2-19): 


S = k In n + — ~ E '* kT + -^(£„ +1 

n nT 


£ (£»i + i “ EOfkT 


and so 

lim S = k In n (2-40) 

r-*o 

Thus as T -> 0, 5 is proportional to the logarithm of the degeneracy of the lowest level. 
Unless n is very large, Eq. (2-40) says that S is practically zero. For example, if the 
system were a gas of TV- point particles, and the degeneracy of the lowest level were of 
the order of TV, A: In TV would be practically zero compared to a typical order of magni- 
tude of the entropy, namely, Nk. Thus setting the “constant” of Eq. (2-30) equal to 
zero is equivalent to adopting the convention that the entropy of most systems is 
zero at the absolute zero of temperature [cf. Eq. (1-44)]. 


ADDITIONAL READING 
General 

Andrews, F. C. 1963. Equilibrium statistical mechanics . New York: Wiley. Chapters 9-11. 

Hill, T. L. 1960. Statistical thermodynamics . Reading, Mass. : Addison- Wesley. Sections 1-1 through 
1-5. 

Kestin, J., and Dorfman, J. R. 1971. A course in statistical thermodynamics . New York: Academic. 

Sections 5-1 through 5-3 and 5-11 through 5-17. 

Kittel, C. 1969. Thermal physics . New York: Wiley. Chapters 1-4. 

Knuth, E. L. 1966. Statistical thermodynamics . New York: McGraw-Hill. Chapters 2, 4, and 5. 
Mayer, J. E., and Mayer, M. G. 1940. Statistical mechanics. New York: Wiley. Chapters 3, 4, and 10. 
Schrodinger, E. 1952. Statistical thermodynamics. Cambridge: Cambridge University Press. 
Chapters 1-3. 


PROBLEMS 

2-1. From statistical mechanics we have shown 



and from the thermodynamics we have 



Why can’t /3 be linearly proportional to the temperature? That is, = constant x T. 

2-2. To investigate the replacement of n 3 by nf, that is, the replacement of the average 
number of systems in state j by the most probable number in state j, consider the simple 
example in which H(n) is just a binomial distribution 


H(n) 


n\ 
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and actually calculate n i* and Hi. Hint: Recall that 


n 


d+*)"= 2 

«i = 0 


n\ 


— «i)l 


2-3. Show that Eq. (2-9) follows from Eq. (2-8). Note that in deriving this result, we have 
written In W( a) as In — s# — <*j In aj + and have considered s# to be a constant. 

Show that Eq. (2-10) is independent of this assumption, that is, derive Eq. (2-10) treating 
as 

2-4. Starting with Eq. (2-31), prove that the Boltzmann constant k must be positive, using 
the fact that the heat capacity C v is always positive. 

2-5. Show that the entropy can be written as 


S=-k^Pj\nPj 

j 

where Pj is given by Eq. (2-12). 

2-6. Maximize the function defined as “information” in information theory. 


/=2^1nPy 

j 

subject to the two constraints 
J 

and 

2EjPj = E = fixed 
j 

Compare this result to that of Problem 1-51. 

2-7. Obtain the most probable distribution of N molecules of an ideal gas contained in 
two equal and connected volumes at the same temperature by minimizing the Helmholtz free 
energy for the two systems. 

2-8. Differentiate Eq. (2-16) with respect to to derive Eq. (2-13). 

2-9. Derive Eq. (2-24). 

2-10. Derive Eqs. (2-31) and (2-32). 

2-11. Derive Eqs. (2-31) through (2-33) by starting with A = —kT In Q. 

2-12. We can derive Eq. (2-36) directly by the method of Lagrange multipliers. We label 
the levels rather than the states by a subscript /. The degeneracy of the /th level, whose energy 
is Ei, is n, . The number of ways of distributing systems over levels, with degeneracy O,, is 


s/\ _ 

^( a )=ff— ni^ 1 

Hi ai 

where a t is the number of systems in the /th level. Maximize this, subject to the constraints 
^a,=si 

l 

^aiEi—S 

i 


to get 

°'* = Qie~ E,lkT 

2-13. Show that for a particle confined to a cube of length a that 


2Ej 
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By taking the ensemble average of both sides, we have 

2 E 


If we use the fact that E = %NkT (to be proved in Chapter 5), we get the ideal gas equation of 
state. 

2-14. We shall show in Chapter 5 that the partition function of a monatomic ideal gas is 


Q{N,v,T)=- 


ImtikTY "' 2 


Derive expressions for the pressure and the energy from this partition function. Also show 
that the ideal gas equation of state is obtained if Q is of the form f(T) V N y where f(T) is any 
function of temperature. 

2-15. In Chapter 1 1 we shall approximate the partition function of a crystal by 


( p-hv/2kT \3N 

where hvjk = © £ is a constant characteristic of the crystal, and U 0 is the sublimation energy 
of the crystal. Calculate the heat capacity from this simple partition function and show that 
at high temperatures, one obtains the law of Dulong and Petit, namely, that C v -► 3 Nk as 
r+oo. 

2-16. In Chapter 13 of this author’s textbook Statistical Thermodynamics , it is shown that 
the partition function of an ideal gas of diatomic molecules in an external electric field & is 


Q(N y V y T y £) 


[g{V,T y g)Y 

N\ 


where 


q(V y T y #) = 


lirmkTX 3/2 /8t r 2 IkT\ e ~ hvl2kT 

k h 2 ) \ h 2 / (1 ~e-^ kT ) 



Here I is the moment of inertia of the molecule; v is its fundamental vibrational frequency; 
and fM is its dipole moment. Using this partition function along with the thermodynamic 
relation, 


dA = —SdT—p dV — MdS 

where M= NfL, where p, is the average dipole moment of a molecule in the direction of the 
external field S y show that 

p ' =t *[ coth {w)-^] 

Sketch this result versus from & = 0 to & = oo and interpret it. 

2-17. In Chapter 14 we shall derive an approximate partition function for a dense gas, 
which is of the form 


Q(N, V,T) = jj j 1 {V-Nbye" 12 ™ 

where a and b are constants that are given in terms of molecular parameters. Calculate the 
equation of state from this partition function. What equation of state is this? Calculate the 
thermodynamic energy and the heat capacity and compare it to Problem 1-30. 

2-18. From electrostatics, the displacement vector D is given by D = & 4- 4tt-P, where is 
the electric field, and P is the polarization, i.e., the total dipole moment M per unit volume. 
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The dielectric constant e is defined by D = e&. In the simple case of a parallel plate capacitor, 
D is the field produced by a set of charges on the plates, and so we can consider it to be 
external field; M is the total moment (both permanent and induced) of the substance between 
the plates; and is the field between the plates, that is, the force that an infinitesimal change 
would feel. If only a vacuum existed between the plates, D and & would be the same. A real 
substance modifies D such that D =& + 47t(M/ V), where V is the volume. Since D, and M 
are all in the same direction (at least for simple fluids), D must be ;><£, which says that 
1 - 

When an external electric field D is present, the first law of thermodynamics becomes 
dE=TdS-pdV-MdD + fxdN 

Problem: Describe concisely how one would calculate e (at least in principle) from statistical 
mechanics. 

2-19. Derive equation 2-40. 



CHAPTER 3 


OTHER ENSEMBLES 
AND FLUCTUATIONS 


In Chapter 2 we considered an ensemble in which N, V, and T are held fixed for each 
system. This ensemble is one of many possible ensembles that can be constructed. For 
example, if we allow the walls of the containers to be permeable to molecular trans- 
port, N is no longer fixed for each system, and we no longer have a canonical ensemble. 
The ensemble in this case is called a grand canonical ensemble and is discussed in 
Section 3-1. In Section 3-2 we discuss two other ensembles that are often used in 
statistical thermodynamics: the microcanonical ensemble, in which N , V, and E 
are fixed, and the isothermal-isobaric ensemble, in which IV, T, and p are fixed. The 
last section. Section 3-3, is devoted to an investigation of fluctuations in statistical 
thermodynamics. 

One of our basic assumptions is that the ensemble average of a mechanical property 
can be equated to the corresponding thermodynamic function; hence it is important 
that we investigate the expected spread about the mean value. We show in Section 3-3 
that for macroscopic systems the probability distribution of observing some mechan- 
ical property is a very narrow Gaussian distribution whose mean is the ensemble 
average. One important deduction from this result is that the various ensembles 
are essentially equivalent and that one can choose to work with a partition function 
on the basis of mathematical convenience rather than on the basis of which thermo- 
dynamic variables are used to specify the system of interest. 

3-1 GRAND CANONICAL ENSEMBLE 

In the previous chapter we treated the canonical ensemble, in which each system 
is enclosed in a container whose walls are heat conducting, but impermeable to the 
passage of molecules. The entire ensemble is placed in a heat bath at temperature T 
until equilibrium is reached, and then is isolated from its surroundings. Each system 
of the ensemble is specified by N, V, and T. In this section we shall treat a grand 
canonical ensemble. In a grand canonical ensemble, each system is enclosed in a 
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container whose walls are both heat conducting and permeable to the passage of 
molecules. The number of molecules in a system, therefore, can range over all possible 
values, that is, each system is open with respect to the transport of matter. We con- 
struct a grand canonical ensemble by placing a collection of such systems in a large 
heat bath at temperature T and a large reservoir of molecules. After equilibrium is 
reached, the entire ensemble is isolated from its surroundings. Since the entire ensemble 
is at equilibrium with respect to the transport of heat and matter, each system is 
specified by V, T, and fi, where ^ is the chemical potential. (If there is more than one 
component, the chemical potential of each component is the same from system to 
system.) Figure 3-1 shows a schematic picture of a grand canonical ensemble. 

We proceed now in the same manner as in the treatment of the canonical ensemble. 
In this case, however, we must specify a system not only by which quantum state it is 
in but also by the number of molecules in the system. For each value of N, there is a 
set of energy states {E Nj (V)}. We let a Nj be the number of systems in the ensemble that 
contain N molecules and are in the state j. Each value of N has a particular set of 
levels associated with it, so we first specify N and then j. The set of occupation numbers 
{a NJ } is a distribution. By the postulate of equal a priori probabilities, we assume that 
all states associated with all possible distributions are to be given equal weight or 
equal probability of occurrence in the ensemble. Each possible distribution must 
satisfy the following three conditions: 


TYa NJ = sf 

(3-1) 

N j 

X X °Nj EnJ ~ & 

(3-2) 

N j 

EE a Nj N = JT 

(3-3) 


N j 

The three symbols stf, and Jf denote the number of systems in the ensemble, the 
total energy of the ensemble (the ensemble is isolated), and the total number of 
molecules in the ensemble. 

For any possible distribution, the number of states is given by 

H ' <W) -irob 



Figure 3-1. A schematic picture of a grand canonical ensemble. Each system has a fixed volume and 
temperature, but is open with respect to molecular transport. 
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As in the treatment of the canonical ensemble, the distribution that maximizes W 
subject to the appropriate constraints completely dominates all others. Thus we 
maximize Eq. (3-4) under the constraints of Eqs. (3-1) through (3-3), respectively, and 
we get (see Problem 3-1) 

a Nj * — e~ a e~ pElfj{v) e~ yN (3-5) 

As before, the parameter a is easily determined in terms of the other parameter(s). 
We sum both sides of Eq. (3-5) over N and j and use Eq. (3-1) to get 

* * e- pE "* v 'e~ yN 

P Nj( V ’ P> ?) “ “ ££ e -fiE»j(V) e -yN ( 3_6) 

N j 

where P Nj (V, /?, y) is the probability that any randomly chosen system contains N 
molecules and be in the jth energy state, with energy E nj (V). 

The averages of the mechanical properties E, P, and N are 


E(V, /J,y) = i£I E N j(V)e-^^e-y" 

a N j 


W P, y) 



(3-7) 


(3-8) 


N(V, P, y) = ^I Ne- pEl) ' (v >e- vN 

~ N j 



where 


(3-9) 


H(K fi, y) = E I e-^WeT* (3-1 0) 

N j 

We now determine p and y. In our treatment of the canonical ensemble, one of the 
methods used to determine P was to derive an equation that related (dE/dV) NtP to 
(dp/dP) NtV and to compare this with a purely thermodynamic equation relating 
( dE/dV) NfT to (dp/dT) NtV [cf. Eqs. (2-17) to (2-21)]. This comparison suggested 
that P was proportional to 1 /T. We then showed that any two systems at the same 
temperature have the same value of /?, thus proving that p = 1/kT, where A: is a 
universal constant. We can do the same thing here (Problem 3-2), but it is not 
necessary. 

A grand canonical ensemble can be considered to be a collection of canonical 
ensembles in thermal equilibrium with each other but with all possible values of N. 
Each of the systems has the same value of /?, regardless of the number of molecules 
it contains. That P has the same value as in the canonical ensemble can be seen by 
imagining that we suddenly make the walls of the containers impermeable to the 
molecules but still heat conducting. This gives us a collection of canonical ensembles 
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with V, TV, and T fixed, and the arguments of Chapter 2 can be used to show that 
1 3 = \/fcT 

The value of y can be found by using the same method that we used in Chapter 2 
to show that p was an integrating factor of Sq rcy . Consider the function 


m { W)» = In H = In X I 

N j 

As the notation indicates, we regard /to be a function of /?, y, and the E N /s. The total 
derivative of / is 


df= (|) dp + if) d y+Il(- s f-) 

\dP/y,{E NJ ) \dyjp t {Enj) N j \dE N j) p t 


y, E NJt s 


dE N j 


Using Eqs. (3-7) through (3-10), we have 


df= -Edp-Ndy-pZZP N j dE *J 

N j 

The last term here is the ensemble average reversible work done by the systems. For 
simplicity, we assume only p-V work to get 


df = -Edp-N dy + ppdV 

Paralleling our development in Chapter 2, we add d(pE) + d(yN) to both sides of this 
equation: 

d(J+ PE + yN) = pdE + PpdV + ydN 
If we compare this to the purely thermodynamic equation 
TdS = dE+pdV- pdN 

and use the fact that P — 1/kT, we can conclude that 


y 

s 


-p 

kT 


E Np 

kin 

T T 


(3-11) 

(3-12) 


In Eq. (3-12), we have set the constant of integration equal to zero in accord with the 
third law of thermodynamics (see Problem 3-5). 

We have now brought the entropy, a nonmechanical property, into our discussion. 
Equation (3-12), along with Eqs. (3-6) through (3-9), allows us to express any thermo- 
dynamic function of interest in a grand canonical ensemble in terms of E(F, T, p). 
This function is called the grand ( canonical ensemble) partition function : 

B(K T, p) = Yl e- E ”* V) ' kT e' iN ' kT (3-1 3) 

N j 

As the canonical partition function is the connection between thermodynamics and 
statistical thermodynamics for closed, isothermal systems ( N , V, and T fixed), the 
grand partition function serves as the link for open, isothermal systems (V, T, and p 
fixed). If we can determine E for a system, we can calculate its thermodynamic 
properties. 

By summing over j for fixed N in Eq. (3-13), we see that it is possible to write 
E in the form 

E(V, T, /0 = E Q(N, V, TY lNfkT 

N 


(3-14) 
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The term e** lkT is often denoted by A. Since p = kT In A, A is an absolute activity, for 
the difference in chemical potentials between two states is given by A/z = kT ln(o 2 /fli), 
where a x and a 2 are activities. 

Since we take the number of systems in an ensemble to be arbitrarily large, the 
number of particles in an ensemble becomes arbitrarily large, and hence the possible 
number of particles in any one system can approach infinity. Therefore the summation 
in Eq. (3-14) can be taken from 0 to oo: 

B(V, T,n) = £ Q(N, V, T)X" (3-1 5) 

N = 0 

Even though it may appear from Eq. (3-15) that E would be more difficult to 
obtain than Q, it actually turns out in many problems that E is easier to obtain, since 
the constraint of constant N is often mathematically awkward. This constraint can 
be avoided by using a grand partition function, that is, by summing over all values of 
N (see Section 4-2). Furthermore, there are many systems in which the many-body 
problem can be reduced to a one-body, two-body problem, and so on. In these cases, 
the grand partition function is particularly useful. 

To complete our discussion of the grand canonical ensemble, we shall show that 
pV is the thermodynamic characteristic function of In E. To see this, compare 
Eq. (3-12) with the thermodynamic equation 


G = pN = E + pV - TS 
Thus we have 

pV = kT In E(F, T, p) (3-1 6) 

Problem 1-37 shows that pV is the thermodynamic function whose natural variables 
are V , T, and /z. Equations (3-8), (3-9), and (3-12) can be derived from Eq. (3-16) 
and the thermodynamic equation d(pV) = S dT + N dp + p dV (see Problem 3-6). 
Table 3-1 summarizes the formulas of the grand canonical ensemble. 


3-2 OTHER ENSEMBLES 

We could go on to consider other ensembles. For example, we could construct an 
ensemble of systems in which the containing walls of each system are heat conducting 
and flexible, so that each system of the ensemble is described by N, T, and p. The 
constraints would be on the total energy and total volume of the ensemble, and the 
partition function would turn out to be (see Problem 3-9) : 

A(N, T, p) = £ £ Q(N, V, E)e- (3-1 7) 

E V 

whose characteristic thermodynamic function is the Gibbs free energy, that is, 

G = - kT In A (N, T, p) (3-1 8) 

Equation (3-17) is called the isothermal-isobaric partition function. Notice that the 
natural variables of G are N, T, and p, the variables associated with this ensemble. 

If we compare Eq. (3-17) with the two other partition functions that we have 
derived [Eqs. (2-34) and (3-13)], we see that all three can be obtained by starting with 
Q(N, V , E), multiplying by some appropriate exponential, and summing over one or 
two of the variables JV, V, and E. In a sense, Q(N, V, E) is fundamental to all 
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ensembles and, in fact, is itself the partition function for conceptually the most simple 
ensemble, the one representative of isolated systems. This is called the microcanonical 
ensemble. 

We can apply the results of the previous section to a treatment of an isolated system. 
The grand canonical ensemble represents a collection of systems whose containing 
walls allow heat and molecules to pass freely from one system to another. From a 
physical point of view, the entire grand canonical ensemble is equivalent to one 
isolated system of volume siV, containing Jf molecules and having energy <f. The 
partitions in Fig. 3-1 can be considered to be a conceptual division of one isolated 
system into si subsystems. The entropy of the entire ensemble S e , considered as one 
isolated system, is siS, where S is the entropy of each of the open, isothermal systems. 
This entropy is given by Eq. (3-12): 

S = k(J$E + yN + In E) (3-19) 


where we use the notation /? and y for convenience. We use Eqs. (3-7) and (3-9) for 
E and TV: 


o , , - e~ pE ^e~ vN _ Ne- pE ”'e~ yN 

S — k In c. 4- ky£pE NJ — 1- 5] y 


= k In E + k X (PE Nj + yN) 


e -P E »J e 'V N 


) 




= k In E — k £ (In a N * + In E — In si) 

NJ & 


(3-20) 


where we have used Eq. (3-6) to write the last line. We can perform the summation 
over the second two terms in parentheses in Eq. (3-20) : 


S = k In E - — £ a NJ * In a NJ * 
& N,J 


k In S + k In si 


or 


S e = siS = ksi In si — kY, a*!* In a Nj * 

n.j 

= k\nW({a Nj *}) (3-21) 

We see that for an isolated system, the entropy is proportional to the logarithm of the 
number of states available to the system. In another notation, we can write 

S = k\nQ(N , V,E) (3-22) 

Equation (3-22) shows that the more states there are available to an isolated system, 
the higher is its entropy. This equation serves as the basis for qualitative statements 
concerning entropy and disorder, randomness, and so on. In practice, Eq. (3-22) is 
not used for the calculation of thermodynamic functions since TV, V, and E are all 
mechanical variables. 

The argument leading to Eq. (2-38) incidentally can be immediately applied to 
Eq. (3-22). For any spontaneous process in an isolated system, 

AS = k In > 0 

fti 

where 1 and 2 represent the initial and final states, respectively. 
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Equation (3-22) is due to Boltzmann and is possibly the best-known equation in 
statistical thermodynamics, mainly for historical reasons. Of course, Boltzmann 
(1844-1906) did not express his famous equation in terms of quantum states, but 
rather in a classical mechanical framework. We shall take up classical statistical 
mechanics in Chapter 7. Boltzmann, in fact, was a great contributor to both equilib- 
rium and nonequilibrium statistical mechanics. He was one of the first to see clearly 
how probability ideas could be combined with mechanics. Equation (3-22) is carved 
on his tombstone in the Zentralfriedhoff in Vienna, although the equation is not often 
used today. However, his contribution to nonequilibrium statistical mechanics is such 
that to this day the so-called Boltzmann equation (Chapters 18 and 19) still is the 
fundamental equation describing the transport of dilute gases. It is interesting to note 
that Boltzmann, who contributed so much to understanding macroscopic phenomena 
in terms of molecular mechanics, lived at a time when the atomic theory was not so 
generally accepted as it is today, and his work was severely criticized by some of the 
leading physicists of the day. He committed suicide in 1906 (for reasons not entirely 
clear) and never lived to see the full acceptance of his work in statistical mechanics. 

Although ft(N, V, E) is not generally available, we have determined it for an ideal 
gas in Section 1-3 [cf. Eq. (1-37)]. If we calculate k In ft, neglecting terms of order 
less than O (N 1 ), we get (see Problem 3-11): 

, , [f2nmkT\ 3/2 Ve 5l2 l 

s - w *4(-H — J (3 - 23) 


We shall see later that this equation gives excellent agreement with experiment, but 
now we simply show that if we use 


dS = l,dE+£dV-^dN 
T T T 


(3-24) 


to get 

P 

T 


a 

dV. 


N, E 


and substitute Eq. (3-23) into this, we find 


pV = NkT 


which is the ideal gas equation of state. See Table 3-1 for a summary of the formulas 
related to the microcanonical ensemble. 

It is possible to derive partition functions appropriate to other sets of independent 
variables, but the four that we have considered above are sufficient for most applica- 
tions. We shall show that in the limit of large systems in equilibrium, one can choose 
an ensemble and its partition function on the basis of mathematical convenience 
rather than on the basis of which thermodynamic variables are used to describe the 
system. This result will come out of a study of fluctuations, which we turn to now. 


3-3 FLUCTUATIONS 

The methods that we have developed allow us to calculate ensemble averages of 
mechanical variables, which we then equate to thermodynamic functions. Equations 
such as Eq. (2-12) or (3-6) are the probability distributions over which these ensemble 
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Table 3-1 . A summary of formulas for several types of ensemble 


microcanonical ensemble, n(7V, V, E ) 
S — k In Cl 

dS = j,dE + j,dV—j,dN 


1 

In H\ 

kr~ 

V SE ) N ,y 

p _ , 

/01n fi\ 

kT 

y W ) n .e 

h 

/81n£l\ 

kT 

\ dN ) Vi 



isothermal-isobaric ensemble, A {N, T, p) 
G— —kT\n A 
dG=-SdT+ Vdp + pdN 



(3-25) 

(3-26) 

(3-27) 

(3-28) 

(3-29) 

(3-30) 

(3-31) 

(3-32) 

(3-33) 

(3-34) 

(3-35) 

(3-36) 


(3-37) 
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averages are taken. In Section 1-5 we saw that the average is the first of a family of 
moments. Another important moment is the second central moment or the variance, 
(x — x) 2 , which is a measure of the spread of a probability distribution about the mean 
value. Furthermore, we saw toward the end of Section 1-5 that the most meaningful 
measure of the spread of a distribution is the square root of the variance, that is, the 
standard deviation, relative to the mean value. A standard deviation of 10 10 may be 
large as an absolute number, but it is extremely small if the mean of the probability 
distribution is 10 20 . In this section we shall calculate the variances of several mechanical 
variables and compare these to the mean values. 

Any deviation of a mechanical variable from its mean value is called a fluctuation, 
and the investigation of the probability of such deviations is called fluctuation theory. 
Fluctuation theory is important in statistical mechanics for a number of reasons. The 
most obvious reason is to determine to what extent we expect to observe deviations 
from the mean values that we calculate. If the spread about these is large, then experi- 
mentally we would observe a range of values, whose mean or average is given by 
statistical thermodynamics. We shall see, however, that the probability of observing 
any value other than the mean value is extremely remote. As a corollary to this im- 
portant result, we shall see that all of the ensembles that we have considered earlier 
are equivalent for all practical purposes. In addition, there are several statistical thermo- 
dynamical theories of solutions and light scattering based on fluctuation theory, and 
one formulation of the statistical mechanical theory of transport focuses on the rate 
of decay of spontaneous fluctuations. 

Let us consider first fluctuations in a canonical ensemble. In a canonical ensemble, 
N, V , and T are held fixed, and we can investigate fluctuations in the energy, pressure, 
and related properties since these are the ones that vary from system to system. It is 
important to be aware of the properties that can vary and those properties that are 
fixed in each ensemble. We shall consider fluctuations in the energy. Thus we use 
Eq. (2-12) for the probability distribution of the energy and write for the variance 


= (£- jE ) 2 = £ 2 -£ 2 


r2 


“I E/Pj-E 


where 


,-PEj 


Pj Q(N, v, p) 


(3-38) 


(3-39) 


We can write Eq. (3-38) in a more convenient form by noting that 
£ £ 2p _ L £ E 2 e~ pEj = — — — Eje' pEj 

Qdp dp dp 

= feT 2 + E 2 

dT 


(3-40) 


Thus Eq. (3-38) becomes 



(3-41) 
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and if we associate £ with the thermodynamic energy, we have 
o E 2 = kT 2 C v 

where C v is the molar heat capacity. 

To explore the relative magnitude of this spread, we look at 

<r £ 0 kT 2 C v ) 1/2 
E E 

To get an order-of-magnitude estimate of this ratio, we use the values of E and C v 
for an ideal gas, namely, 0(NkT) and O(Nk), respectively. If we use these values in 
Eq. (3-43), we find that o E jE is O (N~ 1/2 \ showing that in a typical macroscopic 
system, the relative deviations from the mean are extremely small. The probability 
distribution of the energy may, therefore, be regarded as a Gaussian distribution which 
is practically a delta function. 

We can derive a Gaussian distribution approximation to P(E ), the probability of 
observing a particular value of £ in a canonical ensemble. According to Eq. (2-36), 
jp(£) is given by CQ(E)e~ E/kT , where C is a normalization factor which is independent 
of £. Since Q(£) is an increasing function of £, and e~ E,kT is a decreasing function of 
£, their product £(£) peaks at some value of £, say £*. But we have just seen above 
that the spread about the maximum value is extremely small, and so £* and £ are 
essentially the same point. The width of P(E) is 0(N _1/2 ), and so £* and £ differ by 
0(iV” 1/2 ). 

Let us now expand £(£) in a Taylor series about £*, or £. As in Section 1-5, it is 
more convenient to work with In £(£). From the definition of £*(« £) as the value 
of £ at the maximum in P(£), 


(3-42) 


(3-43) 


(^) 

\ OE /£ = £•=£ 



(3-44) 


Equation (3-44) determines £ as a function of /?. The second derivative of In P{E) is 
/a 2 in p\ /a 2 in n\ 


which is to be evaluated at E = E* = E. Since 


/a 2 In fi\ a 2 lnfi(£) a /ain£2(£)\ 
\ SE 2 / £=e _ dE 2 ~8E\ dE ) 

_ a /ainfi\ dp 
~dE\ dE /£= E _ a£ 


where the last term follows from Eq. (3-44), we have 
/a 2 In P\ dp 1 dT 1 

l dE 2 ) E=E , =E ~dE~ ~kf 2 ~dE kT 2 C v 

The Taylor expansion of In P(£) through quadratic terms is 

lniW-l„P(£)-§g|£ + - 
or 

P(£,.P(£^p{-|^} 


(3-45) 


(3-46) 


(3-47) 
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Problem 3-16 involves showing that terms beyond the quadratic terms can be ignored 
in Eq. (3-46). 

If we compare Eq. (3-47) to the standard form of a Gaussian distribution [Eq. (1-72)], 
we see that a E = kT 2 C v (in agreement with Eq. (3-42)] and that the normalization 
constant P(E) is (2n<r E 2 )~ 12 . Equation (3-47) can be used to calculate the probability 
of observing a value of E that differs from E. For example, the probability of observing 
an energy that differs by 0. 1 percent from the average energy of 1 mole of an ideal 
gas 0(e -1 ° 6 ), an extremely small number. (See Problem 3-12.) 

Incidentally, the derivation of Eq. (3-47) is a case where we must be careful not to 
confuse the variable E in P(E) with £*, £, or the thermodynamic quantity E, which 
unfortunately is also called “ £.” This is especially true of Eq. (3-44), where £ is a 
variable, and £ is that particular value of the variable for which the quantity d In fi/3£, 
a function of £, is equal to the preassigned value of /? (N, V, T are given in a canonical 
ensemble). 

We could also calculate the fluctuations in the pressure in a canonical ensemble, but 
this is left to Problem 3-18. Instead, we consider the fluctuations in a grand canonical 
ensemble. In a grand canonical ensemble, V, T, and fi are held fixed, while the energy 
and number of particles in each system are allowed to vary. We can calculate the 
fluctuation in the number of particles in the same manner as we treated the fluctuation 
in energy in a canonical ensemble. If c N 2 is the variance in the number of particles, then 

o N 2 =N* -N 2 = Y N 2 P nj - N 2 ( 3 - 48 ) 

N,j 

where 


P Nj ~ 


e ~PE„j e -yN 

£(F, /?, y) 


We treat N 2 in analogy to Eq. (3-40): 


£ N>p = I £ = _ i 1 £ Ne -n K , e -yn 

N.j “ n.j ^oyn.j 


li. 

ady 


dN dlnz 

<N=, --^- N -3r 


-kT 



V „ T 


+ N 2 


Thus Eq. (3-48) becomes 



( 3 - 49 ) 


( 3 - 50 ) 


The right-hand side of this equation can be written in a more familiar form by ther- 
modynamic manipulations. Problem 3-26 proves that 

= _ v^_ (8p\ 

\dN) V ' T N 2 \dv) N ' T 

and so 


N 


N 2 kT K 


V 


( 3 - 51 ) 
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where k is the isothermal compressibility 




V 



The value of a N relative to N is 
r kTfc\ m 


°n _ (IcTkY 
N \ V ) 


(3-52) 


(3-53) 


To get an order-of-magnitude estimate of this ratio, we use the fact that k = 1/p for 
an ideal gas to get c n /N = N~ 1/2 . Again we find that relative deviations from the 
mean are very small. The result, 0(N _1/2 ), is typical of fluctuations in statistical 
thermodynamics. 

Since V is fixed in the grand canonical ensemble, the fluctuation in the number of 
particles is proportional to the fluctuation in the density p, and so 

a c N (kTK\ in 

7 =¥=(—) (3 ~ 54) 


There is a condition under which the fluctuations in density are not negligible. At the 
critical point of a substance, (dpldV) NtT is zero, and hence its isothermal compress- 
ibility is infinite. Thus there are large fluctuations in the density from point to point 
in a fluid at its critical point. This is observed macroscopically by the phenomenon 
of critical opalescence, in which a pure substance becomes turbid at its critical point. 

We can also derive a Gaussian approximation to P(N). Let N*(= N) be the value 
of N at the peak in P(N). We have 


P(N ) = CQ(N, F, T)e fitlN 


where C is a normalization constant. Then 



This equation determines N as a function of /?p. Also, 

/ d 2 In P\ Id 2 In 6\ S d In Q(N , V, T ) 

\ SN 2 ) N=f) ~ \ SN 2 ] w= * “ 8N 8N 


1 

“ ~ kT(8N!dn)v. t 

Thus we find 


8pn 

8N 


^=^[ 2 mmt-} < 3 - 66 » 

which gives the same expression for c N 2 as Eq. (3-50). Problems 3-19 through 3-20 
involve the determination of fluctuations in the isothermal-isobaric ensemble. 

An interesting application of the above fluctuation formulas is to the scattering of 
light by the atmosphere. It can be shown that if light of intensity / 0 is incident on a 
region of volume V with a dielectric constant e, which differs from the average value 
of c for the medium £, the intensity of light scattered at an angle 6 at a distance R is 


1(6) n 2 V 2 o e 2 (\ +cos 2 0) 

7 0 “ 2 A 4 R 2 


(3-56) 


where o 2 is the variance of £, and X is the wavelength of the incident light in vacuum. 
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This is called Rayleigh scattering. The dielectric constant e is related to the density 
by the so-called Clausius-Mossotti equation 


£ — 1 
£ + 2 


= Ap 


(3-57) 


which is derived and discussed in most physical chemistry texts. The quantity A is a 
constant, and p is the density. We can see from this equation that fluctuations in p 
lead to fluctuations in £, and hence to Rayleigh scattering by Eq. (3-56). If we calcu- 
late o 2 in terms of a 2 from Eq. (3-57), and use Eq. (3-54) for o 2 , we find (see 
Problem 3-21) 


m 

h 


n 2 kT 

IsF 


k(g - 1) 2 (£ + 2) 2 F 


(1 + cos 2 0) 
R 2 


where k is the isothermal compressibility. By integrating this over the surface of a 
sphere of radius R , we obtain finally 


f R 2 sin 6 d6 d<f> 
Iq A) 

0-3 


This equation shows that the blue color of the sky is due to fluctuations in the density 
of the atmosphere. The 2 4 in the denominator gives rise to a strong dependence on 
wavelength, so that the short wavelengths (blue) of the sun’s light are scattered more 
than the red, and hence the sky appears blue. Similarly, red sunsets and sunrises are 
due to the fact that the long wavelengths (red) are not scattered as much as the blue. 

There is one result of fluctuation theory which will be very useful to us. We have 
stated above that the various ensembles and their partition functions are essentially 
equivalent to each other, and that one can choose to work with a partition function 
on the basis of mathematical convenience. We now show why this is so. 

Consider the canonical partition function: 

Q(N, V,T) = Y 0(N, V, E)e~ E kT (3-58) 

E 

We have seen in Eq. (3-47) that P(E) = Cfi(£)exp( —EjkT) is an extremely narrow 
Gaussian function of E. In the limit of large N (and it is only in the limit of large N 
that classical thermodynamics is valid), only one value of E is important, namely, 
£ = £* = £. Thus in the summation in Eq. (3-58), only the term with E = E contri- 
butes, and Eq. (3-58) becomes 

Q(N , V, T) = Q (N, V , E)e-* lhT (3-59) 

Although the systems of a canonical ensemble can, in principle, assume any value of 
E (as long as it is an eigenvalue of the N-particle Schrodinger equation), it happens 
that the energy of the entire ensemble is distributed uniformly throughout the ensemble, 
and each system is almost certain to be found with the average energy E. A canonical 
ensemble degenerates, in a sense, to a microcanonical ensemble. 

If we take the logarithm of Eq. (3-59) and use Eq. (2-35), we find that 

A = E - kT In n(N, V, E) 
or that 

S = klnSl(N , F, E) 
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This is an alternative derivation of the fundamental relation between the entropy and 
the number of states accessible to the system. 

The general results we have obtained here are also obtained for other ensembles. 
For example, although the systems of a grand canonical ensemble can assume any 
value of N and E, in practice it turns out that the total energy and the total number of 
molecules of the entire ensemble are distributed uniformly throughout the ensemble, 
and each system has the average energy and contains the average number of molecules. 
This, of course, is exactly what one expects intuitively, as long as the systems are of 
macroscopic size and the density is not extremely low. 

These results can be used to write down, by inspection, the characteristic thermo- 
dynamic function of any partition function. Equations (3-58) and (3-59) are a good 
example. Suppose we did not know that A = — kT In Q. We do know that S = k In 12, 
however, and so if we take the logarithm of Eq. (3-59), we get that In Q = S/k — /?£, 
which shows that A — — kT In Q. Since partition functions, in general, are a sum of 
Q (TV, V, E) multiplied by exponential factors, this method can always be used to 
determine the thermodynamic characteristic function. (See Problem 3-15.) 
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PROBLEMS 
3-1. Derive Eq. (3-5). 

3-2. Using a grand canonical formalism, show that any two systems at the same tempera- 
ture have the same value of jS. 

3-3. For a grand canonical ensemble show that 



Compare this to the thermodynamic equation (see Problem 1-31) 



to suggest that = const /T for a grand canonical ensemble. 
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3-4. State and use Euler’s theorem to show 


[d In E\ In S 

p = kT (-w-)„.r kT T- 


3-5. Show that the entropy given by Eq. (3-12) goes to zero as T goes to zero. 

3-6. Derive the principal thermodynamic connection formulas of the grand canonical 
ensemble starting from 


pV—kT In S 


and 


d(pV ) - SdT+ Ndi ll + p dV 
3-7. Show that for a two-component system 


20xi, in , T, V) = £ £ Q{N U N 2 , V, 

N i N 2 

where A, = e UilkT (i = 1, 2). From this derive the corresponding thermodynamic connection 
formulas. 

3-8. In the next chapter we shall see that the grand partition function of an ideal mon- 
atomic gas is 


where q = ^hrmkT h 2 ) 3/2 V. Derive the thermodynamic properties of an ideal monatomic gas 
from E. 

3-9. Show that the partition function appropriate to an isothermal-isobaric ensemble is 
A(N, p, f) = EI D(AT, K E)e- E * kT e~ pVtkT 

E V 

Derive the principal thermodynamic connection formulas for this ensemble. 

3-10. In Problem 5-17 we shall show that the isothermal-isobaric partition function of an 
ideal monatomic gas is 

a U27Tm) 3 ' 2 (kT) 5 ‘ 2 Y 

L Ph* J 

Derive the thermodynamic properties of an ideal monatomic gas from A. 

3-11. Derive Eq. (3-23) starting from Eq. (1-37). 

3-12. Calculate the probability of observing an energy that differs by 10" 4 percent from 
the average energy of 1 mole of an ideal gas. 

3-13. Show that for macroscopic ideal systems, ones obtains the same result for the 
entropy whether one uses S ~ k In O(E), where O is the number of quantum states with energy 
<E [Eq. (1-36)], or S= k In H(£, AE), where H(E, AE) is the number of quantum states 
within energy A E about E [Eq. (1-37)] as long as A E, E is small, but not zero. Show that S is 
insensitive to A E over a wide range of AE. The next problem discusses this remarkable result 
more generally. 

3-14. Let H(E) dE be the number of quantum states between E and E + dE. In Chapter 1 
we showed that H(E) is a monotonically increasing function of E (at least for an ideal gas). 
We can write two obvious inequalities for H(E) : 

0(E) = f n(E') dE' ^ H(E) AE 
J o 

EH(E) ^ f H(E') dE' - <X>(E) 

J o 
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where A£ is a small region surrounding E. By multiplying the second inequality by &E/E, we 
get 

t±E r E 

n(£)A E>—\ n (E')dE' 

t, Jq 


Combining this inequality with the first one above gives 


A£ 

~E 


f n (E f ) dE' < n (E) le < f n(E') 
J o J o 


dE' 


Taking logarithms gives 

In «<£) - In ^ In [£!(£) A £] < In <KE) 

Now unless A£ is extremely small. In (ElAE) is completely negligible compared to In $(£), 
since the total number of states with energies equal to or less than E is at least 0(e N ). Show 
that even if the energy could be measured to a millionth of a percent. In ( E/AE ) « 18, which 
is completefy negligible compared to N. 

3-15. Fluctuation theory provides a simple method to determine the characteristic function 
associated with a particular partition function. Consider the canonical partition function 

Q(N, V,T) = 2 n (N, V, E) e-*>" 

E 

According to the theory of fluctuations, there is effectively only one term in this summation, 
and so we write 

Q(N, V,T) = G{N, V, E) e~ E,kT 


Remembering that S = k In H, we have, upon taking logarithms, that 



E 

icT 


or that 



Proceeding in a like manner, determine the characteristic thermodynamic function of the 
following partition functions: 

N 

Hp,T,m = 2 Q(N, V, T) e-<” v 
v 

<KV, E, /3m) = 2 W, V, E) e»* 

N 

nv, T, Ml, N 2 ) =2 Q(N 1, N 2 , T, V)^' 

N i 

W(p, y,T,N) = 22 &.N, V, si, T) e-» r e** 

V st 

where is surface area, and y is the surface tension. 

3-16. When we derived the Gaussian expression for P(E) in a canonical ensemble, we 
expanded In P(E) in a Taylor expansion about E= E* & E, dropping terms after the quadratic 
term. Show that these terms are negligible. 
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3-17. Show that 

(E-Ey = + 2 r*c„J 

and that 


C E-E Y 
E 3 


-0(AT“ 2 ) 


for a canonical ensemble. 

3-18. Derive an expression for the fluctuation in the pressure in a canonical ensemble. 
3-19. Show that for an isothermal-isobaric ensemble 


P(V)~ P(V*) exp 


{ ( v-vy \ 



3-20. Derive an equation for the fluctuation in the volume in an isothermal-isobaric 
ensemble. In other words, derive an equation for V 2 — V 2 . Express your answer in terms of 
the isothermal compressibility, defined by 



Show that o v /V is of the order of N~ 112 . 

3-21. By calculating o e 2 in terms of o p 2 from Eq. (3-57) and using Eq. (3-54) for cr p 2 , show 
that 


1(6) 7 r 2 kT (1 4- cos 2 6) 

3-22. Show that the fluctuation in energy in a grand canonical ensemble is 

3-23. Show that in a two-component open, isothermal ensemble that 


idNA 

NiN 2 — NiN 2 — kT\ — — 1 

\CfJ,2/ V, T.m 

fdN 2 \ 

= kT \ir) 

\ J V, T. m 2 


3-24. Show that 
IF- H 2 = kT 2 C p 
in an N, p, T ensemble. 

3-25. Use the formulas in Table 3-1 to derive expressions for any other thermodynamic 
functions for each of the four ensembles listed there. 

3-26. Show that 
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BOLTZMANN STATISTICS, 
FERMI-DIRAC STATISTICS, 
AND BOSE-EINSTEIN 
STATISTICS 


The results that we have derived up to now are valid for macroscopic systems. In order 
to apply these equations, it is necessary to have the set of eigenvalues {£)(N, V)} of 
the N-body Schrodinger equation. In general, this is an impossible task. There are 
many important systems, however, in which the N-body Hamiltonian operator can 
be written as a sum of independent individual Hamiltonians. In such cases the total 
energy of the system can be written as a sum of individual energies. This leads to a 
great simplification of the partition function, and allows us to apply the results with 
relative ease. 

We shall see that the final equations depend upon whether the individual particles 
of the system are fermions (that is, the N-body wave function is antisymmetric under 
the interchange of identical particles) or bosons (the N-body wave function is sym- 
metric under the interchange of identical particles). These two types of particles obey 
different laws, called Fermi-Dirac or Bose-Einstein statistics. We shall show that under 
normal conditions (for example, sufficiently high temperatures), both of these distribu- 
tion laws can be approximately reduced to an even simpler one, called Boltzmann 
statistics. The Boltzmann distribution law can also be derived from Q(N, V, T) at 
high temperature without first deriving the Fermi-Dirac and Bose-Einstein distribution 
laws, and this is done in Section 4-1. We shall discuss in this section just what is meant 
by “normal” conditions or “sufficiently high” temperatures. Then in Section 4-2 
we derive the two fundamental distribution laws, Fermi-Dirac and Bose-Einstein 
statistics, and show how both of them reduce to Boltzmann statistics in the appropriate 
limit. 

4-1 THE SPECIAL CASE OF BOLTZMANN STATISTICS 

In Section 1-3 it was shown that if the Hamiltonian of a many-body system can be 
written as a sum of one-body Hamiltonians, the energy of the system is the sum of 
individual energies, and the wave function is a product of the single-particle wave 
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functions. In addition, the wave functions of a system of identical particles must 
satisfy certain symmetry requirements with respect to the interchange of the particles. 
All known particles fall into two classes: those whose wave function must be symmetric 
under the operation of the interchange of two identical particles, and those whose 
wave function must be antisymmetric under such an exchange. Particles belonging to 
the first class are called bosons, and the others are called fermions. There is no restric- 
tion of the distribution of bosons over their available energy states, but fermions have 
the very severe restriction that no two identical fermions can occupy the same single- 
particle energy state. This restriction follows immediately from the requirement that 
the wave function be antisymmetric (see Problem 1-26). These considerations become 
important in enumerating the many-body energy states available to the system. 

There are many problems in which the Hamiltonian can be written as a sum of 
simpler Hamiltonians. The most obvious example perhaps is the case of a dilute gas, 
where the molecules are on the average far apart, and hence their intermolecular 
interactions can be neglected. Another example, which may be familiar from physical 
chemistry, is the decomposition of the Hamiltonian of a polyatomic molecule into its 
various degrees of freedom: 


translational 


rotational 


vibrational 


+ *, 


electronic 


(4-1) 


Equation (4-1) is a good first approximation and can be systematically corrected by 
the introduction of small interaction terms. 

There are many other problems in physics in which the Hamiltonian, by a proper 
and clever selection of variables, can be written as a sum of individual terms. Although 
these individual terms need not be Hamiltonians for actual individual molecules, they 
are nevertheless used to define the so-called quasi-particles, which mathematically 
behave like independent real particles. Some of the names of quasi-particles that are 
found in the literature are photons, phonons, plasmons, magnons, rotons, and other 
“ ons.” In spite of the apparent limitation of this requirement on the Hamiltonian, we 
can see that it is very useful and can be used to study solids (Chapter 11) and liquids 
(see Chapter 1 2 of Statistical Thermodynamics ), systems in which the decomposition of a 
many-body Hamiltonian into a sum of independent terms would hardly appear to be 
justified. First let us consider the canonical partition function for a system of dis- 
tinguishable particles, in which the Hamiltonian can be written as a sum of individual 
terms. Denote the individual energy states by {e/}, where the superscript denotes the 
particle (they are distinguishable), and the subscript denotes the state. In this case the 
canonical partition function becomes 

Q(N , fJ) = Z e~ Ej/kT = £ c -(«i“+«/+*k c +-)/fcT 

j .j.*.... 

j b /kT £ e -t u c !kT 
i j k 

= q a q b q c ( 4 - 2 ) 

where 


= E 


V kT 


<Z(F,T) = X>“ £i '* r (4-3) 

i 

Equation (4-2) is a very important result. It shows that if we can write the N-particle 
Hamiltonian as a sum of independent terms, and if the particles are distinguishable , 
then the calculation of Q(N , V, T) reduces to a calculation of q(V 9 T). Since q(V, T ) 
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requires a knowledge only of the energy values of an individual particle or quasi- 
particle, its evaluation is quite feasible. In most cases {ej is a set of molecular energy 
states; thus q(V , T) is called a molecular partition function. 

If the energy states of all the particles are the same, then Eq. (4-2) becomes 

2(N, V , T) = [q(V, T)] 1 * (distinguishable particles) (4-4) 

Equation (4-4) shows that the original N-body problem (the evaluation of Q(N, V, T)) 
can be reduced to a one-body problem (the evaluation of q{V , T)) if the particles are 
independent and distinguishable. Although particles are certainly not distinguishable 
in general, there are many important cases where they can be treated as such. An 
excellent example of this is a perfect crystal. In a perfect crystal each atom is confined 
to one and only one lattice point, which we could, in principle, identify by a set of 
three numbers. Since each particle, then, is confined to a lattice point and the lattice 
points are distinguishable, the particles themselves are distinguishable. Furthermore, 
we shall see in Chapter 1 1 that although there are strong intermolecular interactions 
in crystals, we can treat the vibration of each particle about its lattice point as inde- 
pendent to a first approximation. 

Another useful application of the separation indicated in Eq. (4-2) is to the mole- 
cular partition function itself. Equation (4-1) shows that the molecular Hamiltonian 
can be approximated by a sum of Hamiltonians for the various degrees of freedom of 
the molecule. Consequently we get the useful result that 

^molecule (7translational(7rotational(7vibrational(7electronic (4—5) 

where, for example, 

^translational = £ (4-6) 

i 

Thus not only can we reduce an N-body problem to a one-body problem, but it is 
possible to reduce it further into the individual degrees of freedom of the single 
particles. 

Equation (4-4) is an attractive result, but atoms and molecules are, in general, not 
distinguishable; thus the utility of Eq. (4-4) is severely limited. The situation becomes, 
more complicated when the inherent indistinguishability of atoms and molecules is 
considered. In this case, the N-body energy is 

Eijki-.- = £» + Sj + £* + + * • • (4-7) 

and the partition function is 

Q(N, FJ)= £ g -(*,+« i +« k +e I +... ) /fcT ( 4-8) 

*\ J , K l — 

Because the molecules are indistinguishable, one cannot sum over i, j 9 separately 

as we did to get Eq. (4-2). 

Consider, for example, the case of fermions. The antisymmetry of the wave function 
requires that no two identical fermions can occupy the same single-particle energy 
state. Thus in Eq. (4-8), terms in which two or more indices are the same cannot be 
included in the summation. The indices i, /, k 9 /, and so on, are not independent of 
one another, and a direct evaluation of Q(N, V, T ) for fermions by means of Eq. (4-8) 
is very difficult. 

Bosons do not have the restriction that no two can occupy the same molecular state, 
but the summation in Eq. (4-8) is still complicated. Consider a term in Eq. (4-8) in 
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which all of the indices are the same except one, that is, a term of the form 
£; + 8j + £; + £; + -•- with i ^ j. Because the particles are indistinguishable, the 
position of z t is unimportant, and so this state is identical with Zj + e t + £j + £j + • • • 
or Sj + Zj + £i + Zj + Sj + • • • , and so on. Such a state should be included only once 
in Eq. (4-8), but an unrestricted summation over the indices in Eq. (4-8) would 
produce N terms of this type. Consider the other extreme in which all of the particles 

are in different molecular states, that is, the state with energy z ( + Zj + z k -{ with 

Because the particles are indistinguishable, the N! states obtained by 
permuting the N different subscripts are identical and should occur only once in 
Eq. (4-8). Such terms will, of course, appear N\ times in an unrestricted summation. 
Consequently, a direct evaluation of Q for bosons by means of Eq. (4-8) also is 
difficult. 

The terms that introduce complications are those in which two or more indices are 
the same. If it were not for this kind of term, one could carry out the summation in 
Eq. (4-8) in an unrestricted manner, and then correct the sum by dividing by N\ It 
turns out that this procedure yields an excellent approximation in many (most) cases 
for the following reason. 

We showed in Section 1-3 that for a particle in a box, the number of molecular 
quantum states with energy < z is 



8ma 2 e\ 312 


(4-9) 


For m = 10“ 22 g, a = 10 cm, and T = 300°K, <5>(z) = O(10 30 ). Although this calcula- 
tion is done for one particle in a cube (i.e., one molecule of an ideal gas), the order of 
magnitude of the result is general. Thus we see that the number of molecular quantum 
states available to a molecule at room temperature, say, is much greater than the 
number of molecules in the system for all but the most extreme densities. Since each 
particle has many individual states to choose from, it will be a rare event for two 
particles to be in the same molecular state. Therefore the vast majority of terms in 
Eq. (4-8) will have all different indices. This allows us to sum over all the indices 
unrestrictedly and divide by N! to get 






(indistinguishable particles) 


(4-10) 


with 


q(y, T) = Y e~ tl/kT 

j 

for a system of identical, indistinguishable particles satisfying the condition that the 
number of available molecular states is much greater than the number of particles. 

Equation (4-10) is an extremely important result, since it reduces a many-body 
problem to a one-body problem. No longer is there a condition of distinguishability; 
the indistinguishability of the particles has been included by dividing by N!, a valid 
procedure for most systems under most conditions. We can investigate this condition 
in more detail using Eq. (4-9) for an ideal gas. Mathematically, we require that 

<D(£) > N 

Using Eq. (4-9), we have the condition 
n [\2mk 

e\lF 



(4-11) 
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where we have set c = 3kT/2. Clearly this condition is favored by large mass, high 
temperature, and low density. Numerically it turns out that (4-1 1) is satisfied for all 
but the very lightest molecules at very low temperatures. Table 4-1 examines this 
condition for a number of systems. We see that the use of Eq. (4-10) is justified in 
most cases. We have examined (4-1 1) for only monatomic systems, but the results are 
valid for polyatomic molecules as well, since the translational energy states account 
for almost all of the energy states available to any molecule. 

When Eq. (4-10) is valid, that is, when the number of available molecular states is 
much greater than the number of particles in the system, we say that the particles obey 
Boltzmann statistics. Boltzmann statistics is an approximation that becomes increas- 
ingly better at higher temperatures. We shall show in Chapter 7 that at high enough 
temperatures, one can describe the energy of a system by classical mechanics. Since 
the limiting case of Boltzmann statistics and the use of classical mechanics both require 
a high-temperature limit, Boltzmann statistics is also called the classical limit. 

Let us examine Eq. (4-10). The total energy of the N-body system is 


E = Ns = kT 2 ( 

V dT h.v j J 


e -*jl kT 

q 


(4-12) 


The first equality is valid, because the molecules are assumed to be independent, and 
hence their energies are additive. We see from Eq. (4-12) that the average energy of a 
particle is 

e~ BjlkT 

«=!>,• (4-13) 

j <1 


We can conclude from this equation that the probability that a molecule is in the yth 
energy stare 


e~ ejkT e~ £jkT 


(4-14) 


It is interesting to note that the fluctuations in e are of the same order as e itself 
(see Problems 4-18 and 4-19), that is, the probability distribution for single molecules 
is not sharp. A sharp probability distribution is a many-body effect. 


Table 4-1. The quantity (6N/7T V)(h 2 ll2mkT) 312 for a number of simple systems* 



T(°K) 

6N / h 2 \ 3 ' 2 
t tV \12 mkTj 

liquid helium 

4 

1.6 

gaseous helium 

4 

0.11 

gaseous helium 

20 

2.0 x 10- 3 

gaseous helium 

100 

3.5 x IO" 5 

liquid neon 

27 

I.I x 10- 2 

gaseous neon 

27 

8.2 x 10- 5 

gaseous neon 

100 

3.1 x I0-® 

liquid argon 

86 

5.1 x IO" 6 

gaseous argon 

86 

1.6 x 10- 6 

liquid krypton 

127 

5.4 x 10- 5 

gaseous krypton 

127 

2.0 x 10- 7 

electrons in metals 
(sodium) 

300 

1465 


* This quantity must be much less than unity for Eq. (4-10) to be valid. The temperatures associated with 
the liquid states are the normal boiling points [cf. Eq. (4-11)]. 
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The similarity between Eq. (4-14) for molecular states and Eq. (2-12) for states of 
the entire N-body system is not fortuitous. Equation (4-14) can be derived by the same 
mathematical formalism of Chapter 2. The ensemble is considered to be the N actual 
molecules in thermal contact with each other. The number of molecules rij in the state 
with energy £j is found by maximizing a combinatorial factor similar to Eq. (1-77). 
This point of view was the one originally proposed by Boltzmann. It is valid only for 
systems in which the total energy is a sum of individual molecular energies, that is, 
only for dilute gases. The conceptual generalization of these ideas by Gibbs was a 
magnificent achievement, which allowed statistical thermodynamics to be applicable 
to all physical systems. Furthermore, the derivation given in Chapter 2 is rigorous, 
since macroscopic systems can be labeled, and the size of the ensemble can be in- 
creased arbitrarily. This is not so for the Boltzmann approach, since the molecules 
cannot be labeled, and the system is finite. 

Equation (4-14) can be reduced further if we assume that the energy of the molecule 
can be written in the form [cf. Eq. (4-1)] 

£ = e/ rans + c/ 0t + £ fc vib + £ z elec + •♦• 

Then Eq. (4-14) and Eq. (4-5) can be combined to give, for example, 

-r.vib 

n vib = (4-1 5) 

tfvib 

for the probability that a molecule is in the yth vibrational state irrespective of the 
other degrees of freedom. 

Although Eq. (4-10) is applicable to most systems, it is important to complete the 
development of systems of independent, indistinguishable particles by evaluating 
Eq. (4-8) for the general case. The exact evaluation of Eq. (4-8) is necessary for 
several systems that we shall study. We must return, then, to a consideration of the 
effect of the symmetry requirements of N-body wave functions on the sum over states 
in Eq. (4-8). 

4-2 FERMI-DIRAC AND BOSE-EINSTEIN STATISTICS 

There are two cases to consider in the evaluation of Eq. (4-8). The resultant distri- 
bution function in the case of fermions is called Fermi-Dirac statistics, and that in the 
case of bosons is called Bose-Einstein statistics. Since all known particles are either 
fermions or bosons, these two “ statistics ” are the only exact distributions. We shall 
see, however, that in the case of high temperature and/or low density, both of these 
distributions go over into the Boltzmann or classical distribution. 

It is most convenient to treat the general case by means of the grand canonical 
ensemble for reasons that we shall see shortly. Let Ej(N , V) be the energy states avail- 
able to a system containing N molecules. Let e k be the molecular quantum states. 
Finally, let n k = n k (Ej) be the number of molecules in the kth molecular state when the 
system itself is in the quantum state with energy E } . A quantum state of the entire 
system is specified by the set {/?*}. The energy of the system is 

E J = 'Z £ k n k (4-16) 

k 

and, of course, 

N = l"k 

k 


(4-17) 
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We can write Q(N, V, T ) as 

Q(N, K T) = £ e - "*' = £* e - " Ei (4-1 8) 

j ("k> 

where the asterisk in the summation signifies the restriction that 

This restriction turns out to be mathematically awkward. We can avoid this restric- 
tion by using the grand canonical partition function instead. This will be an excellent 
example where one partition function is much easier to evaluate than another. Since 
we have demonstrated the equivalence of ensembles, we are free to make the choice 
strictly on mathematical convenience. We then use 

T,v)=Y e^Q(N, V, T) 

N = 0 

We use Eq. (4-18) for Q(N , V y T ) and the absolute activity X = e fitl to get 


oo 


5 (V,T,n)= £ £* e 

N= 0 {n k > 


-fiTi e f m 


co 

-IE* 


N = 0 {n k > 


tjnj 


= £ £* 

N = 0 {n k } k 


(4-19) 


Now comes the crucial step (which requires some thought). Since we are summing 
over all values of N, each n k will range over all possible values, and Eq. (4-19) can be 
written as (see Problem 4-6) 


m m ®* n 2 m “* 

s (V,T,n)= £ £ (4-20) 

H | = 0 n 2 = 0 k 

Equations (4-19) and (4-20) are completely equivalent. Equation (4-20) can be written 
in a more lucid form: 


H(KT,/i) = 


Bi m « 


B 2 mM 


£ £ (Xe~ 0ci ) ni ■ ■ ■ 

nj = 0 n 2 = 0 


or 


= n Z (4-21) 

k n k = 0 

Equation (4-21) is a simple product and is a general result. The crucial step in this 
series of equations is the step from Eq. (4-19) to Eq. (4-20), from which Eq. (4-21) 
follows immediately. The step from Eqs. (4-19) to (4-20) is possible only because we 
are summing over all values of N y or, in other words, since we are using the grand 
canonical partition function. 

We now apply Eq. (4-21) to fermions and bosons. In Fermi-Dirac statistics, each 
of the n k in Eq. (4-21) can be only either 0 or 1, since no two particles can be in the 
same quantum state. In this case ni™** = 1, and Eq. (4-21) is simply 

“fd = n (1 "*■ Ae~ fiCk ) 

k 

where FD, of course, signifies Fermi-Dirac. 


(4-22) 
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In Bose-Einstein statistics, on the other hand, the n k can be 0, 1,2,..., since there 
is no restriction on the occupancy of each state. Therefore, « k max = oo, and Eq. (4-21) 
becomes 


E be = n Z (te~ Pek T k = n (! “ te~ Ptk Y l te~ p * k < 1 (4-23) 

k nif= 0 k 

To get Eq. (4-23), we have used the fact that 

j = 0 

for x < 1. 

Equations (4-22) and (4-23) are the two fundamental distributions of the statistical 
thermodynamics of systems of independent particles. We can combine these two 
equations into 

Sfd = n 0 ± ^e~ l>Ck ) ±1 (4-24) 

BE k 

where as the notation indicates, the upper sign refers to Fermi-Dirac statistics, and 
the lower sign refers to Bose-Einstein statistics. 

Using Eq. (3-33), we see that 


N = N = £ n k = kT 

k 


k Ofi / V,T \ CA / V,T 


= E 


Xe~ ptk 
1 + Xe~ p ‘ k 


The average number of particles in the /cth quantum state is 
Xe~ fCk 

"* “ 1 ± Xe-o* 


(4-25) 


(4-26) 


Equation (4-26) is the quantum statistical counterpart of Eq. (4-14). We multiply 
Eq. (4-26) by e k and sum over k to get the quantum statistical version of Eq. (4-13). 


E = Ne = n k e k 

k 


y te k e- p * 
T 1 ± 


Lastly, Eq. (3-16) gives 

pV= ±kTY J ^[\±ke~^] 

k 


(4-27) 


(4-28) 


Equations (4-25) through (4-28) are the fundamental formulas of Fermi-Dirac 
(+) and Bose-Einstein (— ) statistics. Note that the molecular partition function q 
is not a relevant quantity when we are dealing with quantum statistics, that is, Fermi- 
Dirac or Bose-Einstein statistics. In spite of the fact that we have neglected inter- 
molecular forces, the individual particles of the system are not independent because 
of the symmetry requirements of the wave functions. 

We noted above that both kinds of statistics should go over into Boltzmann or 
classical statistics in the limit of high temperature or low density, where the number of 
available molecular quantum states is much greater than the number of particles. This 
condition implies that the average number of molecules in any state is very small, since 
most states will be unoccupied and those few states that are occupied will most likely 
contain only one molecule. This means that n k -> 0 in Eq. (4-26). This is achieved by 
letting A->0. Thermodynamically, this means the limit of N/V -► 0 for fixed T, or 
T oo for fixed N/V. (See Problem 4-3.) For small A, Eq. (4-26) becomes 


n k = Ae Pek (A small) 
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If we sum both sides of this equation over k to eliminate A, we have 

n k e~ PCk 
~N ~ ~~q~ 


( 4 - 29 ) 


where 

q = 


( 4 - 30 ) 


Equation (4-26) then goes over to the Boltzmann or classical limit for both Fermi- 
Dirac and Bose-Einstein statistics. 

Equations (4-27) and (4-28) also reduce to the formulas of Section 4-1 as A -► 0. 
Equation (4-27) becomes 

£-►£ Uje-* z * 
j 

and since iij — » we have 


E Y,E,e~ etJ 

p — >. — l 

N Jje-K 


( 4 - 31 ) 


This is the same as Eq. (4-13). Similarly, for small A we can expand the logarithm in 
Eq. (4-28) to get 

pV^(±kT)^±lY i e~ f ' J 'j ( 4 “ 32 ) 

We have used the fact that ln(l + x) « x for small x. Using Eq. (4-30), this becomes 

pV = XkT X e~ = XkTq ( 4 - 33 ) 

j 


or 


PpV = In H = Xq 


( 4 - 34 ) 


Equation (3-33) can be used to show that Xq = N, and so Eq. (4-34) is the perfect gas 
law as expected. Thus the formulas of Fermi-Dirac and Bose-Einstein statistics reduce 
to those of Boltzmann statistics in the classical limits. 

We can also derive Eq. (4-10) directly from Eq. (4-34) for E: 


2 — 


00 


I 


JV = 0 


N\ 


If we compare this to Eq. (3-15), see that 

We shall defer a discussion of the equations of Fermi-Dirac and Bose-Einstein 
statistics to Chapter 10. There are a few systems such as electrons in metals, liquid 
helium, electromagnetic radiation, for which one must use quantum statistics. For 
most systems that we shall study in this book, however, we shall be able to use Boltz- 
mann or classical statistics. In the next chapter we shall apply the limit of Boltzmann 
statistics to the simplest system, namely, a monatomic ideal gas. 
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PROBLEMS 

4-1. Calculate the temperature below which each of the substances listed below cannot be 
treated classically at 1 atmosphere. Compare this with the normal boiling temperature for 
each substance. 

He, Ne, Ar, Kr, C0 2 , N 2 , H 2 , Cl 2 , H 2 0 

4-2. Show that the quantity 

6N / h 2 \ 3 ' 2 
ttV \\2mkT / 

given in Table 4-1 is indeed very large for electrons in metals at room temperature. 

4-3. Show that the condition that A->0 corresponds thermodynamically to the limit 
N/V^O for fixed T, or co for fixed density. Remember that A = e Ptl . 

4-4. In deriving the limiting case of Boltzmann statistics, we claimed that if the number of 
quantum states M far exceeds the number of particles N, then the terms in the product of the 
molecular partition functions in which each particle is in a different quantum state constitute 
the overwhelming number of terms. Show, in fact, that the ratio of this type of term to the 
total number of terms approaches unity as N/M -► 0, N and M both large. Hint : remember 
that 



4-5. For an ideal gas, show that the relation 


2 £kin 

= 3~P~ 


holds irrespective of its statistics, where E k i„ is the total kinetic energy. 

4-6. To convince yourself of the step leading from Eq. (4-19) to Eq. (4-20), consider the 
summation 


s =2 2 *x,"‘x 2 " 2 

N = 0 {«/) 

where n v and n 2 = 0, 1, and 2. Show by directly expanding S for this simple case that this is 
equivalent to [Eq. (4-21)] 

S = (1 + Xk + Xk 2 ) 

K-l 

4-7. Recall that the equation of state for an ideal quantum gas is 
pV = kT\n E = ±kT2 In [1 ± A<r e "* T ] 
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where A = e MkT . Using the fact that the summation over states can be replaced by an integra- 
tion over energy levels 

(2m\ 3/2 

<o(e) de = 27rl — I Ve 1/2 de 

derive the quantum virial expansion 

P_ = 1 f (T1) J A J 

kT ^ A 3 j.i y 5 ' 2 

where A = (/i 2 IrnnkT) 1 ' 2 . 

4-8. Show that the entropy of an ideal quantum gas can be written as 
S — — k 2 [«/ In fij±{ 1 =F «,) In (1 T «j)] 


where the upper (lower) sign denotes Fermi-Dirac (Bose-Einstein) statistics. 

4-9. Show that pV> (N>kT for fermions, and pV< < N>kT for bosons. 

4-10. Consider a system of independent, distinguishable particles, each of which has only 
two accessible states; a ground state of energy 0 and an excited state of energy e. If the system 
is in equilibrium with a heat bath of temperature T, calculate A, E, S , and C v . Sketch C v versus 
T. Does the choice of the ground-state energy — 0 affect P, C„, or SI How would your results 
change if e 0 were added to both energy values? 

4-11. Generalize Eq. (4-10) to the case of a mixture of several different species of non- 
interacting particles. 

4-12. Consider a system of N distinguishable independent particles, each of which can be 
in the state + e 0 or — e 0 . Let the number of particles with energy ±e 0 be N± , so that the 
energy is 

E = N+ eo — N- Eo = 2N+ Eo — Ne 0 


Evaluate the partition function Q by summing exp (—E:kT) over levels and compare your 
result to Q = q N . Do not forget the degeneracy of the levels, which in this case is the number 
of ways that N+ particles out of N can be in the -f state. Calculate and plot the heat capacity 
C v for this system. 

4-13. The vibrational energy levels of a diatomic molecule can be approximated by a 
quantum mechanical harmonic oscillator. The fundamental vibrational frequency v is 
0(10 13 sec -1 ) for many diatomic molecules. Calculate the fraction of molecules in the first 
few vibrational levels in an ideal diatomic gas at 25°C. Derive a closed expression for the 
fraction of molecules in all excited states. 

4-14. The rotational energy of diatomic molecules can be well approximated by a quantum 
mechanical rigid rotor. According to Eq. (1-32), the energy levels depend upon the moment 
of inertia, which for a diatomic molecule is 0(10“ 40 g-cm 2 ). Calculate and plot the population 
of rotational levels of a diatomic ideal gas at 25°C. Do not forget to include the degeneracy 
2J+ 1. 

4-15. Show that Q(N , V,T) = [q{V, T)] N /N\ implies that q(V 9 T) = f(T)V. Do this in both 
the canonical ensemble and grand canonical ensemble formalisms. 

4-16. Show that the most probable distribution of 2N molecules of an ideal gas contained 
in two equal and connected volumes at the same temperature is N molecules in each volume. 

4-17. In Fermi-Dirac statistics, the maximum occupancy of any state is 1, while in Bose- 
Einstein statistics, it is oo. All particles appear to obey one of these two statistics. In 1940, 
however, Gentile* investigated the implications of an intermediate statistics, in which the 
maximum occupancy is m. Derive the distribution law for this case. 


* G. Gentile, Nuovo Cimento 17, p. 493, 1940. 
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4-18. Derive the equation 



for independent particles and show that fluctuations of molecular energies are not at all 
negligible. 

4-19. Starting from Eqs. (4-13) and (4-14), show that the fluctuations in e, the energy of a 
single particle, are not small, and in fact, are given by 



4-20. Consider a gas in equilibrium with the surface of a solid. Some of the molecules of 
the gas will be adsorbed onto the surface, and the number adsorbed will be a function of the 
pressure of the gas. A simple statistical mechanical model for this system is to picture the 
solid surface to be a two-dimensional lattice of M sites. Each of these sites can be either un- 
occupied, or occupied by at most one of the molecules of the gas. Let the partition function 
of an unoccupied site be 1 and that of an occupied site b e#(T). (We do not need to know#(T) 
here.) Assuming that molecules adsorbed onto the lattice sites do not interact with each other, 
the partition function of N molecules adsorbed onto M sites is then 


The binomial coefficient accounts for the number of ways of distributing the N molecules 
over the M sites. By using the fact the adsorbed molecules are in equilibrium with the gas 
phase molecules (considered to be an ideal gas), derive an expression for the fractional 
coverage, 6 = N/M, as a function of the pressure of the gas. Such an expression, that is, 
6(p ), is called an adsorption isotherm, and this model gives the so-called Langmuir adsorption 
isotherm. 

4-21. Consider a lattice of M equivalent noninteracting magnetic dipoles, jz (associated, 
say, with electron or nuclear spins). When placed in a magnetic field H , each dipole can orient 
itself either in the same direction, f, or opposed to, j, the field. The energy of a dipole is —\lH 
if oriented with the field, and + \lH if oriented against the field. Let N be the number of J 
states and M — N the number of f states. For a given value of N, the total energy is 

pHN - pH(M — N) = (2N — M)fxH 


The total magnetic moment I is 
/ = (M — 2N)fM 

where is the average value of N for a given M, H, and 71 The work necessary to increase H 
by dH is — IdH . Find the specific heat C and the total magnetic moment for this system, and 
sketch both I versus \iHjkT , that is, the total magnetization versus the applied field, and 
C/Nk versus kTfoj.H. 

4-22. (a) Consider a system of M independent and distinguishable macromolecules on 
which any number from 0 to m small molecules may bind. Let q(j) be the macromolecular 
partition function when J molecules are bound. If there are N small molecules (or ions) and 
M macromolecules (say proteins), then 


Q(N, M, T) = Z* 

a 


Ml q(0Y°<}(iy i ■ ■ -gim)'" 
a 0 lail 
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where the number of macromolecules having j bound molecules is a Jy and where the asterisk 
indicates the restrictions 

m m 

2 a J=^M ^ jaj = N 

j=o J=0 

Show that the grand partition function for this system can be written in the form 
E(M,7» = f0x, T) M 
where 

fOz, T) =q( 0) + 1)A + • • ■ + q(m) X m 

Interpret this result. 

(b)* Extend this result to the case in which the macromolecules are not distinguishable. 



CHAPTER 5 


IDEAL MONATOMIC GAS 


In this chapter, we shall apply the general results of the preceding chapters to an ideal 
monatomic gas. By ideal, we mean a gas dilute enough that intermolecular interactions 
can be neglected. The results that we derive here will be applicable to real monatomic 
gases at pressures and temperatures for which the equation of state is well represented 
by pV = NkT , that is, pressures below 1 atmosphere and temperatures greater than 
room temperature. 

We have shown in Section 4-1 that the number of available quantum states far 
exceeds the number of particles for an ideal gas. Thus we can write the partition 
function of the entire system in terms of the individual atomic partition functions: 


<2(N, 


V,T) = 


MV, Tjf 

N\ 


(5-1) 


A monatomic gas has translational, electronic, and nuclear degrees of freedom. The 
translational Hamiltonian is separable from the electronic and nuclear degrees of 
freedom, and the electronic and nuclear Hamiltonians are separable to a very good 
approximation. Thus we have 


f) 9trans Select *7nucl 


(5-2) 


We shall study each of these factors separately in the following sections of this chapter. 


5-1 THE TRANSLATIONAL PARTITION FUNCTION 


In this section we shall evaluate the translational partition function. The energy 
states are given by 


h 2 


'n x Hy Rs 


8 ma A 


(n 2 + n y 2 + n 2 ) 


1 > 2 , . . . 


(5-3) 
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We substitute this into q trans to get 



This summation cannot be evaluated in closed form, that is, it cannot be expressed 
in terms of any simple analytic function. This does not present any difficulty, however, 
for the following reason. The successive terms in these summations differ so little from 
each other that the terms vary essentially continuously, and so the summation can, for 
all practical purposes, be replaced by an integral. To prove this, we show that the 
argument of the exponential changes little in going from n x to n x + 1 . This difference, 
A, is given by 

A _ Ph 2 (n x + l) 2 fih 2 n x 1 _ fih 2 (2n x + 1) 

8 ma 2 %ma 2 %ma 2 


At room temperature, for m — 10 22 g and a = 10 cm, this difference is 
A » {2n x + 1) x 10" 20 

A typical value of n x at room temperature is O(10 10 ) (see Problem 5-3), so A is indeed 
very small for all but very large values of n x . A value of n x for which A is as large as 
10“ 5 would correspond to an energy of (10 10 kT), an extremely improbable energy. 
Thus we can replace the summation in Eq. (5-4) by an integration: 

T) - (J V'*™-" 1 *,)’ - (5-5) 

where we have written V for a 3 . 

It is instructive to evaluate q trans in another way. Equation (5-4) is a sum over the 
states of the system. We could also write q tnns as a sum over levels. Recognizing that 
the levels are very densely distributed, we can write q iTans as an integral: 


Qtrans 



(5-6) 


The function co(e) is the number of energy states between e and e + de 9 or, in other 
words, the effective degeneracy. Equation (5-6) is simply a continuous form of a sum 
over levels rather than a sum over states. We have already evaluated co(e) in Section 1-3. 
It is given by Eq. (1-35) 




(5-7) 


If we substitute this into Eq. (5-6), we get 
n /8 mcV /2 r“ 1/2 _ Bt , 

*-■•-4 hr ) 


/2nmkT\ 3/1 

\ h 2 ) 


V 


Of course, we obtain the same result as Eq. (5-5). 


(5-8) 
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The factor ( h 2 /2nmkT ) 1/2 that occurs in the translational partition function has units 
of length and is usually denoted by A. In this notation, Eq. (5-5) or (5-8) read 

V 

Etrans (5—9) 

The quantity A can be given the following interpretation. The average translational 
or kinetic energy of an ideal gas molecule can be calculated immediately from 
Eq. (5-8) and Eq. (4-13), which in terms of < 7 trans is 



We find that et rans = \kT 9 and since e trans = p 2 /2m , where p 2 is the momentum of the 
particle, we can say that the average momentum is essentially (mkT) 1/2 . Thus A is 
essentially h/p , which is equal to the De Broglie wavelength of the particle. Conse- 
quently, A is called the thermal De Broglie wavelength. The condition for the applica- 
bility of classical or Boltzmann statistics is equivalent to the condition that A 3 /F ^ 1, 
which physically says that the thermal De Broglie wavelength must be small compared 
to the dimensions of the container. Such a condition is similar to the condition that 
quantum effects decrease as the De Broglie wavelength becomes small (cf. Table 4-1). 

5-2 THE ELECTRONIC AND NUCLEAR PARTITION FUNCTIONS 

In this section we shall investigate the electronic and nuclear contributions to q. 

It is more convenient to write the electronic partition function as a sum of levels 
rather than a sum over states. We have, then, 

?e (5-10) 

where o ei is the degeneracy, and the energy of the ith electronic level. We first fix 
the arbitrary zero of energy such that e t = 0, that is, we shall measure all of our elec- 
tronic energies relative to the ground state. The electronic contribution to q can then 
be written as 

Select = «*.i + <» e2 + ■■■ (5-1 1 ) 

where A e tj is the energy of the yth electronic level relative to the ground state. These 
Ac’s are typically of the order of electron volts, and so /?Ae is typically quite large at 
ordinary temperatures (see Problem 5-10). Therefore at ordinary temperatures, only 
the first term in the summation for q cicct is significantly different from zero. However, 
there are some cases, such as the halogen atoms, where the first excited state lies only a 
fraction of an electron volt above the ground state, so that several terms in # clcct are 
necessary. Even in these cases the sum converges extremely rapidly. 

The electronic energies of atoms and ions are determined by atomic spectroscopy 
and are well tabulated. The standard reference is the tables of Moore* which list the 
energy levels and energies of many atoms and ions. Table 5-1 lists the first few levels 
for H, He, Li, O, and F. A look at this table will indicate that electronic states are 
labeled or characterized by a so-called term symbol, which is briefly explained in 
Section 5-4. (A knowledge of the meaning of atomic term symbols is not necessary for 
the calculation of q c \ cct , but they are explained in Section 5-4 for completeness.) 


* See Table 5-1. 



84 IDEAL MONATOMIC GAS 


Table 5-1 . Atomic energy states 


atom 

electron 

configuration 

term 

symbol 

degeneracy 
g = 2J+l 

energy (cm -1 ) 

energy (eV) 

H 

1 s 

2 -S./2 

2 

0 

0 


2 P 

2 Pi>2 

2 

82258.907 

10.20 


2 s 

2 S U 2 

2 

82258.942 



2 P 

2 P*n 

4 

82259.272 


He 

Is 2 

L S 0 

1 

0 



\s2s 

3 St 

3 

159850.318 

19.82 



‘5o 

1 

166271.70 


Li 

\s 2 2s 

2 *Sl/2 

2 

0 



\s 2 2p 

2 Pl/2 

2 

14903.66 

1.85 



2 P 3/2 

4 

14904.00 



1s 2 3j 

2 S lt 2 

2 

27206.12 


O 

ls 2 2s 2 2p* 

3 p 2 

5 

0 




3 p 2 

3 

158.5 

0.02 



3 Po 

1 

226.5 

0.03 



1 d 2 

5 

15867.7 

1.97 



'So 

1 

33792.4 

4.19 

F 

\s 2 2s 2 2p s 

2 P*,2 

4 

0 




2 Pl, 2 

2 

404.0 

0.05 


ls 2 2i 2 2p 4 3s 

*Pi,2 

6 

102406.50 

12.70 



*P 3I 2 

4 

102681.24 




*Pll2 

2 

102841.20 




2 P 3/2 

4 

104731.86 




2 Pu2 

2 

105057.10 


Source : 

C. E. Moore, “Atomic Energy States, 

” Natl. Bur. Standards , 

Circ., 1, p. 467, 1949. 


Some general observations about Table 5-1 are: All the rare gases have a ground 
state 1 S 0 (called a singlet 5) with the first excited state 0(10 eV) higher; the alkali 
metals have a 2 *Si/ 2 (called a doublet S) ground state with the next state 0(1 eV) higher; 
the halogen atoms have a 2 P 3/2 (called a doublet P) ground state with the next one, a 
2 P lt 2 (also a doublet P ) only 0(0.1 eV) higher. Thus at ordinary temperatures the 
electronic partition function of the rare gases is essentially unity and that of the alkali 
metals is 2, while those for halogen atoms consist of two terms. 

Using the data in Table 5-1, we can now calculate the fraction of He atoms in the 
lowest triplet state, 3 5' 1 . This fraction is given by 


co e 2 e ~^ 2 

co el + co e2 e~ fiA£l2 + co e3 e- fiM '> + • • • 
3e~ fiMl2 

1 + 3e~ fiAei2 + co e3 + ■ ■ • 


(5-12) 


At 300°K, j3 As 12 =: 770, and so f 2 » 10" 334 . Even at 3000°K, f 2 « 10" 33 . This is 
typical of the rare gases. The energy separation must be less than a few hundred cm -1 
or so before any population of that level is significant. Incidentally, it is useful to know 
that Boltzmann’s constant in units of cm - Vdeg-molecule is 0.695 (almost In 2), and 
1 eV = 8065.73 cm" 1 . 

Table 5-2 gives the fraction of fluorine atoms in the first excited electronic state as 
a function of temperature. It can be seen that fluorine is a case where it is necessary 
to use two terms in q clcc . 

We shall write the electronic partition function as 


q^c{T)K(o ei +o> e2 e fic ' 2 


(5-13) 
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Table 5-2. The fraction of fluorine atoms in the first excited electronic state as a function of temperature 


r(°K) 

h 


0.027 


0.105 

600 

0.160 

800 

0.195 

1000 

0.219 

1200 

0.236 

2000 

0.272 


but at temperatures at which the second term is not negligible with respect to the first 
term, we must check the possible contribution of higher terms as well. This will rarely 
be necessary, however. 

We now consider the nuclear partition function. The nuclear partition function has 
a form similar to that of the electronic partition function. Nuclear energy levels are 
separated by millions of electron volts, however, which means that it requires tem- 
peratures of the order of 10 10o K to produce excited nuclei. At terrestrial temperatures 
then, we need consider only the first term, that is, the degeneracy of the ground nuclear 
state co nl . We take our zero of nuclear energy states to be the ground state. Note that 
we have taken the overall atomic ground state to be the atom in its ground translational, 
electronic, and nuclear states. The nuclear partition function, q n = co nl , then contrib- 
utes only a multiplicative constant to Q, and hence affects only the entropy and free 
energies by a constant additive factor. Since the nuclear state is rarely altered in any 
chemical process, it does not contribute to thermodynamic changes, and so we shall 
usually not include it in q. We cannot do this in the case of the electronic contribution 
since there are many chemical processes in which the electronic states change. 

This completes the partition function of monatomic ideal gases. In summary then, 
we have 


Q = 


((Ztrans (7elec (7nucl) 

N! 


where 

(2nmkT\ 312 V 

?trans — I JjL I * ~ 


q.\^ = 0) el + (o e2 e + 


(5-14) 


<7 nU ci ="„!+••• (5-15) 

The nuclear partition function, although not always equal to unity, is usually omitted. 
We can now calculate thermodynamic properties of a monatomic ideal gas. 


5-3 THERMODYNAMIC FUNCTIONS 
The Helmholtz free energy is given by 

A(N, V, T) = -kT In Q = -NIcT 

— NkT \n(o) el + u) c2 e~ fiAei2 ) (5-1 6) 

The argument of the first logarithm here is much larger than the argument of the 
second logarithm, and so the electronic contribution to A is quite small. 
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The thermodynamic energy is 




NkT + 


Na> e2 As 12 e ** Aci2 

^Zelec 


(5-17) 


The contribution of the electronic degrees of freedom to the energy is small at ordinary 
temperatures (see Problem 5-12). Since we have neglected the contribution of the 
intermolecular potential to the total energy of the gas, the first term of Eq. (5-17) 
represents only kinetic energy. Furthermore, each atom has an average kinetic energy 
ZkTjl , or kT/2 for each degree of translational freedom. We shall give an interesting 
interpretation of this result when we study classical statistical mechanics in Chapter 7. 
If we ignore the very small contribution from the electronic degrees of freedom, the 
molar heat capacity at constant volume is 3Nk/2, a well-known experimental result 
for dilute gases. 

The pressure is 


p = kT 



NkT 

V 


(5-1 8) 


Note that Eq. (5-18) results because q(V 9 T) is of the form /(T)F, and the only contri- 
bution to the pressure is from the translational energy of the atoms. This is what one 
expects intuitively, since the pressure is due to bombardment of the walls of the 
container by the atoms and molecules of the gas. 

The entropy is given by 


5 = 


fNfc + Nfc In 



+ Nkln((o el + (o e2 e- p&ti2 ) 


= Nk In 



+ Selec 


| Nko) e2 M £ 12 e~ fiAti2 

Qclcc 


(5-19) 


(5-20) 


In Eq. (5-20), S clcct denotes the last two terms of Eq. (5-19). Equation (5-20) is called 
the Sackur-Tetrode equation. Table 5-3 compares the results of this equation with 
experimental values for several monatomic gases. 


Table 5-3. Comparison of experimental entropies at 1 atm and T = 298°K to those calculated from the 
statistical thermodynamical equation for the entropy of an ideal monatomic gas* 



exp. 

(e.u.) 

calc. 

(e.u.) 

He 

30.13 

30.11 

Ne 

34.95 

34.94 

Ar 

36.98 

36.97 

Kr 

39.19 

39.18 

Xe 

40.53 

40.52 

C 

37.76 

37.76 

Na 

36.72 

36.70 

A1 

39.30 

39.36 

Ag 

41.32 

41.31 

Hg 

41.8 

41.78 


* The experimental values have been corrected for any nonideal gas behavior. 
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The chemical potential is 

- [(^f -«■ 

K 2nmkT \ 3/2 1 

— a 1 &T - A:T ln^ n + kT In p 

= p 0 (T) + kT In p ( 5 - 21 ) 

where the last line is the thermodynamic equation for p(T, p) for an ideal gas 
[cf. Eq. (1-66)]. Thus statistical thermodynamics yields an expression for fi 0 (X): 

Ho(T)= kT \ ~ kTln ^» ( 5 - 22 ) 

The argument of the first logarithm here has units of pressure, but remember that there 
is a kT\n p term in Eq. (5-21) so that p(T 9 p) itself has units of energy and is O {kT). 


5-4 A DIGRESSION ON ATOMIC TERM SYMBOLS 


The electronic state of an atom is designated by a so-called atomic term symbol. 
Since one encounters atomic term symbols in the calculation of electronic partition 
functions, we discuss them in this section. (The quantum mechanical level of this 
section is above that of most of the book, and it is not necessary to read this section 
on first reading.) 

In addition to the usual kinetic energy and electrostatic terms in the Hamiltonian 
of a many-electron atom, there are a number of magnetic and spin terms. The most 
important of these is the spin-orbit interaction term, which represents the interaction 
of the magnetic moment associated with the spin of the electron with the magnetic 
field generated by the electric current produced by its own orbital motion. There are 
other terms such as spin-spin and orbit-orbit interaction terms, but these are numeri- 
cally much less important. The Hamiltonian can then be written as 


h--£x + 


2m y 


i<j r ij 


( 5 - 23 ) 


where 1 j and s j are the individual electronic orbital and spin angular momenta, 
respectively, and £(rj) is a scalar function of Yj , whose form is not necessary here. We 
can abbreviate this equation by writing 


H = H 0 + H ee + H so ( 5 - 24 ) 

where H 0 represents the first two terms (no interelectronic interactions), H ee the third, 
and H s0 the fourth. 

For light atoms (Z < 40), H so is small enough to be considered a small perturbation. 
If H s0 is neglected altogether, it can be shown that the total orbital angular momentum 
L and the total spin angular momentum S are conserved ( i.e., have “ good ” quantum 
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numbers, or commute with H 0 4- H ee ). However, in this case, the individual orbital 
and spin angular momenta are not conserved; hence they are not useful concepts. 
The eigenvalues of the square of the total orbital angular momentum operator L 2 
and the square of the total spin angular momentum operator 5 2 are L{L 4- 1 )h 2 and 
S(S 4- 1 )ft 2 , respectively. One often interprets these eigenvalues by saying that the 
orbital angular momentum has the value L or that the total spin is 5, but it should 
always be borne in mind that the orbital or spin angular momentum itself is not an 
eigenoperator in quantum mechanics. 

The quantities L and S are the vector sums of the individual orbital ly and spin Sy, 
angular momenta. The possible ways of adding these ly or s y are governed by quantum 
mechanics with the result being that only certain values of the quantum numbers L 
and S are allowed. In the case of two electrons, L can take on only the values 
1 4 + 1 2 > li + I2 ““ 1. | li — 1 2 |, with a similar result for S. What we really mean 

by this, of course, is that the only allowed values of the eigenvalues of L 2 are 

(li + I2XI1 + 12 + W 2 , Oi + U - I) 0 i + 1 2 )* 2 , (I li - I2IXI li 7 I2I + I)* 2 - 

The addition of electronic angular momenta to obtain L for cases involving more than 
two electrons can be accomplished using a scheme which is a straightforward but 
rather tedious electron-by-electron extension of the above two electron systems. 
Actually, rather specialized and advanced techniques have been developed to handle 
this problem, but we need not be concerned with them here. 

The electronic energy states are designated by a term symbol, part of which is given 
by 2S+1 L. Terms with L = 0, 1, 2, . . . are denoted by S, P, D, 

When the spin-orbit term is taken into account, L and S are no longer conserved 
(that is, do not commute with the total H) 9 and only the total angular momentum, 
J = L 4- S, is conserved. The eigenvalues of J 2 = (t 4- §) 2 are J(J 4- 1 )ft 2 , with a 
degeneracy 2J 4- 1, corresponding to the 2 J 4- 1 eigenvalues of J z , namely, 
Jh 9 (J — 1 )h, . . . , — Jh. Just as in the addition of l t and 1 2 above, the allowed values 

of J are L + 5, L 4- 5 - 1 | L — S| . The spin-orbit term causes each of these 

values of J to have a slightly different energy, and so the value of J is included in the 
term symbol as a subscript to give 2S+1 Lj as a characterization of the electronic state 
of an atom. 

Table 5^1 lists the first few electronic states for some of the first row atoms. 

This light atom approximate coupling scheme, in which L and S are almost good 
quantum numbers (that is, not good quantum numbers because of the small spin-orbit 
perturbation term) and in which the total angular momentum J is found by adding 
L and S, is called Russell-Saunders or L-S coupling. As the atomic number of the 
atom becomes larger, the spin-orbit term becomes larger than the interelectronic 
repulsion term, and H ee can be considered to be a small perturbation on the others. In 
this case L and S are no longer useful, and the individual total angular momenta, 
ji = 1 £ 4- S| , become the approximately conserved quantities. One then couples the 
j’s to get the total angular momentum. This scheme is called j-j coupling and is 
applicable to heavier atoms. In spite of the deterioration of L~S coupling as Z increases, 
it is still approximately useful, and so the electronic states of even heavy atoms are 
designated by term symbols of the form 2S+1 Lj . 

We are ready to discuss an ideal gas of diatomic molecules. In addition to having 
translational and electronic degrees of freedom, diatomic molecules have rotational 
and vibrational degrees of freedom as well. It should be apparent at this point that the 
additional input into our statistical thermodynamical equations will be the rotation- 
vibration energy levels. 
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We could also leave the study of gases for a while and apply our general results to 
other systems such as solids and liquids. This would involve no more effort than 
continuing with ideal gases and perhaps would be a change of pace. We could go 
directly to Chapter 11, for example, but we shall finish gases before treating other 
systems. 
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PROBLEMS 

5-1. Convert Boltzmann’s constant k = 1.3806 x 10 ~ i6 ergs-molecule ~ 1 deg -1 to cm -1 - 
molecule -1 -deg _1 and to eV-molecule " 1 -deg " 1 . 

5-2. By considering the special case of an ideal gas, determine the order of magnitude of 
E, A, G y S y Cy , and jx. Express your answers in terms of N, kT y or Nk, whichever is ap- 
propriate. 

5-3. Calculate the value of n X9 n yt and n 2 for the case n x = n y = n x for a hydrogen atom 
(atomic weight 1.00) in a box of dimensions 1 cc if the particle has a kinetic energy 3AT/2, 
for r=27°C. What significant fact does this calculation illustrate? 

5-4. Calculate the entropy of Ne at 300 K and 1 atmosphere. Use the entropy, in turn, to 
estimate the “ translational ” degeneracy of the gas. 

5-5. Calculate the entropy of 1 mole of argon at 298 "TC and 1 atm and compare this to 
Table 5-3. 

5-6. The quantum mechanical energy of a particle confined to a rectangular parallelepiped 
of lengths a 9 b t and c is 

(nS nS\ 

“8 m\a 2 + b 2 + c 2 j 

Show that the translational partition function for this geometry is the same as that of a cube 
of the same volume. 

5-7. Given that the quantum mechanical energy levels of a particle in a two-dimensional 
box are 

h 2 

e = S^ (Sx2 + SyI) = 

First calculate the density of states t*>(e) de, that is, the number of states between e and e de, 
and use this to find the translational partition function of a two-dimensional ideal gas. Then 
find the partition function by another method. And finally find the equation of state, the 
thermodynamic energy E, the heat capacity C A , and the entropy. This is a model for a gas 
adsorbed onto a surface or for long-chain fatty acids on the surface of water, say, as long as 
the number of molecules per unit area is small enough. 

5-8. Calculate the entropy of a mixture of 50 percent neon and 50 percent argon at 500°K 
and 10 atm, assuming ideal behavior. 
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5-9, Calculate the De Broglie wavelength of an argon atom at 25°C and compare this with 
the average interatomic spacing at 1 atm. 

5-10. Evaluate j3Ae e ie C at room temperature, given that electronic energy levels are usually 
separated by energies of the order of electron volts. 

5-11. Using the data in Table 5-1, calculate the population of the first few electronic 
energy levels of an oxygen atom at room temperature. 

5-12. Show that the contribution of the electronic degrees of freedom to the total energy 
is small at ordinary temperatures [cf Eq. (5-17)]. 

5-13. Generalize the results of this chapter to an ideal binary mixture. In particular, show 
that 


<h Nl q2 N2 
Q ~ Ni\N 2 \ 

E=i(N 1 + N 2 )kT 

and 


/ Ve s ' 2 \ 


N 2 k\n 



if we ignore q *iec and q n ud . 

5-14. Derive the standard thermodynamic formula for the entropy of mixing by starting 
with Eq. (5-20). 

5-15. Calculate A, E, /x, C v , and S for 1 mole of Kr at 25°C and 1 atm (assuming ideal 
behavior). 

5-16. Show that the most probable distribution of 2N molecules of an ideal gas contained 
in two equal and connected volumes at the same temperature is N molecules in each volume. 

5-17. Evaluate the isothermal-isobaric partition function of a monatomic ideal gas by 
converting the summation over V in Eq. (3-17) to an integral. The result is 


tkT\ N 


Using the fact that G — —kT In A, derive expressions for S and V. 

5-18. Consider a monatomic ideal gas of N particles in a volume V. Show that the number 
n of particles in some small subvolume v is given by the Poisson distribution 


Pn = (A?)" 




n\ 



Hint: Use the grand canonical ensemble and particularly the result that E = exp (A#). 
5-19. Calculate q c i ec for a hydrogen atom. The energy levels are given by 


£,= 


27 r 2 me A 
n 2 h 2 


« = 1 , 2 ,... 


and the degeneracy is 2n 2 . How is this seemingly paradoxical result explained? (See S. J. 
Strickler, J. Chem. Educ 43, p. 364, 1966.) 



CHAPTER 6 


IDEAL DIATOMIC GAS 


In this chapter we shall treat an ideal gas composed of diatomic molecules. In addition 
to translational and electronic degrees of freedom, diatomic molecules possess vibra- 
tional and rotational degrees of freedom as well. The general procedure would be to 
set up the Schrodinger equation for two nuclei and /z-electrons and to solve this equa- 
tion for the set of eigenvalues of the diatomic molecule. Such a general exact approach 
is very difficult and has been done only for H 2 . Fortunately, a series of very good 
approximations can be used to reduce this complicated two-nuclei, /z-electron problem 
to a set of simpler problems. The simplest of these approximations is the rigid rotor- 
harmonic oscillator approximation. In Section 6-1 we shall discuss this approxima- 
tion, and then in Sections 6-2 and 6-3 we discuss the vibrational and rotational 
partition functions within this approximation. Section 6-4 is a discussion of the sym- 
metry of the wave functions of homonuclear diatomic molecules under the interchange 
of the two nuclei, and Section 6-5 is an application of these results to the rotational 
partition function of homonuclear diatomic molecules. This section contains a detailed 
discussion of ortho- and para-hydrogen. Section 6-6 summarizes the thermodynamic 
functions under the rigid rotor-harmonic oscillator approximation. 

6-1 THE RIGID ROTOR-HARMONIC OSCILLATOR APPROXIMATION 

We first make the Born-Oppenheimer approximation. The physical basis of the 
Born-Oppenheimer approximation is that the nuclei are much more massive than the 
electrons, and thus move slowly relative to the electrons. Therefore the electrons can 
be considered to move in a field produced by the nuclei fixed at some internuclear 
separation. Mathematically, the Schrodinger equation approximately separates into 
two simpler equations. One equation describes the motion of the electrons in the field 



92 


IDEAL DIATOMIC GAS 


of the fixed nuclei. We denote the eigenvalues of this equation by w/r), where r is the 
intemuclear separation. The other equation describes the motion of the nuclei in the 
electronic potential w/r), that is, the potential set up by the electrons in the electronic 
state j. Each electronic state of the molecule creates its own characteristic internuclear 
potential. As in the atomic case, the first excited electronic state usually lies several 
electron volts above the ground state, and so only the ground electronic state potential 
is necessary. The calculation of Uj(r) for even the ground state is a difficult ^-electron 
calculation, and so semiempirical approximations such as the Morse potential are 
often used. Figure 6-1 illustrates a typical internuclear potential. Given w 0 (r), we treat 
the motion of the two nuclei in this potential. 

Problem 1-8 shows that the motion of two masses in a spherically symmetric poten- 
tial can be rigorously separated into two separate problems by the introduction of 
center of mass and relative coordinates. The center-of-mass motion is that of a freely 



Figure 6-1 . The intemuclear potential energy curve for the ground state of I 2 as computed from ultraviolet 
spectroscopy. The dashed curve is the Morse curve. (From R. D. Verma, J. Chem. Phys . , 
32, 738, 1960.) 
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translating point of mass m x + m 2 situated at the center of mass. The other problem 
is that of the relative motion of the two bodies, which can be interpreted as one body 
of reduced mass fi = m x m 2 l{m x + jw 2 ) moving about the other one fixed at the origin. 
The Hamiltonian can be written as 


trans » ini 

with eigenvalues 

= ^ trans "b e im 

The partition function of the diatomic molecule, therefore, is 

Q (7trans(7inl 

where 

r2jt(/n, + m 2 )kT ~\ 3/2 

^trans ~ ^2 J ^ 

The density of translational states alone is so great that we can write 
Q(N,V,T)= q " n ™f Bt 


( 6 - 1 ) 


( 6 - 2 ) 


Thus we must investigate q int to complete our treatment of diatomic molecules. 

The relative motion of the two nuclei in the potential u[r) consists of rotary motion 
about the center of mass and relative vibratory motion of the two nuclei. It turns out 
that the amplitude of the vibratory motion is very small, and so it is a good approxi- 
mation to consider the angular motion to be that of a rigid dumbbell of fixed inter- 
nuclear distance r e . In addition, the internuclear potential u(r) can be expanded 
about r e : 

« r) = u(r - } +(r ~ '•> + i<r - + ■ ■ ■ 

= u(r e ) + \k(r - r e ) 2 + ■ ■ • (6-3) 

The linear term vanishes because (du dr) is zero at the minimum of u(r). The parameter 
A: is a measure of the curvature of the potential at the minimum and is called the force 
constant. A large value of k implies a stiff bond; a small value implies a loose bond. 

The approximation introduced in the previous paragraph is called the rigid rotor- 
harmonic oscillator approximation. It allows the Hamiltonian of the relative motion 
of the nuclei to be written as 

•^rot,vib = 3f r ot + -^vib (rigid rotor-harmonic oscillator (6-4) 

approximation) 

and 


fi rot,vib e rot "b C vib (6—1 5) 

The partition function # rottVib , then, is 

4rot,vib = flrotflvib (rigid rotor-harmonic oscillator approximation) (6-6) 
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The energy eigenvalues and the degeneracy of a rigid rotor are given in Eq. (1-32) 
_ h 2 J(J + 1) 

£j ~ 2/ J = 0,1,2, 

(Oj = 2J + 1 (6-7) 

where / is the moment of inertia, \xr 2 9 of the molecule. The energy and degeneracy of 
an harmonic oscillator are [cf. Eq. (1-31)] 


e V ib = hv(n + i) 
co n = 1 for all n 


" = 0 , 1 , 2 ,.. 


where 


v 


271 



1/2 


( 6 - 8 ) 


( 6 - 9 ) 


Transitions from one rotational level to another can be induced by electromagnetic 
radiation. The selection rules for this are: (1) The molecule must have a permanent 
dipole moment, and (2) A J = ±1. The frequency of radiation absorbed in the process 
of going from a level J to J + 1 is given by 


v = 


fi j+i ~ 8 j 


4n z I 


(J + 1) J = 0, 1, 2, . . . 


( 6 - 10 ) 


We thus expect absorption of radiation at frequencies given by multiples of h/4n 2 I 
and should observe a set of equally spaced spectral lines, which for typical molecular 
values of /x and r 2 will be found in the microwave region. Experimentally one does 
see a series of almost equally spaced lines in the microwave spectra of linear molecules. 
The usual units of frequency in this region are wave numbers, or reciprocal wave- 
lengths. 


<d ( cm 0 = t = - 


(6-1 1 ) 


Microwave spectroscopists define the rotational constant B by h\%n 2 Ic (units of 
cm' 1 ), so that the energy of rigid rotor (in cm' 1 ) becomes 


sj = BJ(J + 1) 


( 6 - 12 ) 


Table 6-1 lists the values of B for several diatomic molecules. 

For a molecule to change its vibrational state by absorbing radiation it must (1) 
change its dipole moment when vibrating and (2) obey the selection rule A n= ±1. The 
frequency of absorption is, then, seen to be 


s n + l ~ £ n __ _J_ \ lf 2 

h In \ji) 


( 6 - 13 ) 


Equation (6-13) predicts that the vibrational spectrum of a diatomic molecule will 
consist of just one line. This line occurs in the infrared, typically around 1000 cm' 1 , 
giving force constants k of the order of 10 5 or 10 6 dynes/cm. (See Problem 6-5.) 
Table 6-1 gives the force constants of a number of diatomic molecules. 



THE RIGID ROTOR-HARMONIC OSCILLATOR APPROXIMATION 


95 


Table 6-1 . Molecular constants for several diatomic molecules* 


molecule 

electronic 

state 

w 

(cm -1 ) 

©. 

CK) 

B 

(cm -1 ) 

(K) 

k x 10- 5 
(dynes/cm) 

D 0 

(kcal/mole) 

h 2 

‘Z/ 

4320 

6215 

59.3 

85.3 

5.5 

103.2 

d 2 


3054 

4394 

29.9 

42.7 

5.5 

104.6 

Cl 2 


561 

808 

0.244 

0.351 

3.2 

57.1 

Br 2 


322 

463 

0.0809 

0.116 

2.4 

45.4 

I 2 

‘Z. + 

214 

308 

0.0373 

0.0537 

1.7 

35.6 

o 2 

3 Z.- 

1568 

2256 

1.437 

2.07 

11.6 

118.0 

n 2 


2345 

3374 

2.001 

2.88 

22.6 

225.1 

CO 


2157 

3103 

1.925 

2.77 

18.7 

255.8 

NO 

2 Ol/* 

1890 

2719 

1.695 

2.45 

15.7 

150.0 

HC1 

T + 

2938 

4227 

10.44 

15.02 

4.9 

102.2 

HBr 


2640 

3787 

8.36 

12.02 

3.9 

82.4 

HI 


2270 

3266 

6.46 

9.06 

3.0 

70.5 

Na 2 


159 

229 

0.154 

0.221 

0.17 

17.3 

k 2 

■z. + 

92.3 

133 

0.0561 

0.081 

0.10 

15.8 


* These parameters were obtained from a variety of sources and do not necessarily represent the most 
accurate values since they are obtained under the rigid rotor-harmonic oscillator approximation. 


We furthermore assume that the electronic and nuclear degrees of freedom can be 
written separately, and thus we have 

3 %* = ^trans ^rot + ^ vib + ^ elcc + ^ nucl (6-14) 

which implies that 


£ ^trans 4" ^rot 4“ ^vib 4“ ^elec 4“ ^nucl 


(6-15) 


and 


Q trails (Zrot (Zvib (Zelec (Znucl 


(6-16) 


The translational partition function is given by Eq. (6-1); the electronic partition 
function will be similar to Eq. (5-1 1) for a monatomic gas; and we shall usually adopt 
the convention that q nucl = 1. Although Eq. (6-14) is not exact, it serves as a useful 
approximation. Within this approximation, the partition function of the gas itself is 
given by 


Q(N y V, T) = 


((Ztrans (Zrot (Zvib (7 elec (Znucl) 


N\ 


( 6 — 1 7 ) 


We shall introduce several corrections to Eqs. (6-14) through (6-17) in Problems 6-23 
and 6-24. 

Only the vibrational and rotational contributions to the partition function are not 
known in Eq. (6-17), and we shall discuss these contributions in the next few sections. 
Before discussing these, however, we must choose a zero of energy for the rotational 
and vibrational states. The zero of rotational energy will usually be taken to be the 
J = 0 state. In the vibrational case we have two choices. One is to take the zero of 
vibrational energy to be that of the ground state, and the other is to take the zero to be 
the bottom of the intemuclear potential well. In the first case, the energy of the ground 
vibrational state is zero, and in the second case it is hv/ 2. We shall choose the zero of 
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Figure 6-2. The ground and first excited electronic states as a function of the internuclear separation 
r, illustrating the quantities j D 0 , D ey and e 2 . 

vibrational energy to be the bottom of the internuclear potential well of the lowest 
electronic state. Lastly, we take the zero of the electronic energy to be the separated, 
electronically unexcited atoms at rest. If we denote the depth of the ground electronic 
state potential well by D e , the energy of the ground electronic state is —D ey and the 
electronic partition function is 

tfelec = 0>ei e DJkT + co e2 e~ tl,kT + • • • (6-18) 

where D e and e 2 are shown in Fig. 6-2. We also define a quantity D 0 by D e — \h\. As 
Fig. 6-2 shows, D 0 is the energy difference between the lowest vibrational state and the 
dissociated molecule. The quantity D 0 can be measured spectroscopically (by pre- 
dissociation spectra, for example) or calorimetrically from the heat of reaction at any 
one temperature and the heat capacities from 0°K to that temperature. The values of 
D 0 for a number of diatomic molecules are given in Table 6-1. 


6-2 THE VIBRATIONAL PARTITION FUNCTION 

Since we are measuring the vibrational energy levels relative to the bottom of the 
internuclear potential well, we have 

e„ = (n + i)hv n = 0, 1, 2, . . . (6-1 9) 

with v = (kfri) l,2 !2n 9 where k is the force constant of the molecule, and fi is its reduced 
mass [cf. Eq. (6-9)]. The vibrational partition function q vib , then, becomes 

<7vi b (X) = X e-** 

n 

_ e ~0W2 ^ g—fihvn 
n=0 
e ~fihvf2 


1 - e- fihv 


( 6 - 20 ) 
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where we have recognized the summation above as a geometric series. This is one of 
the rare cases in which q can be summed directly without having to approximate it by 
an integral, as we did in the translational case in Chapter 5 and shall do shortly in the 
rotational case. The quantity phv is ordinarily larger than 1, but if the temperature is 
high enough, phv < 1, and we can replace the sum in Eq. (6-20) by an integral to get 

q vib (T) = e-W 2 F e~ fihvn dn = — ( kT > hv) (6-21 ) 

■*0 hv 


which we see is what results from Eq. (6-20) if phv < 1. Although we shall rarely 
use this approximation since we have q vib (T) exactly, it will be interesting to compare 
this limit to some others which we shall derive later on. From q vib (T) we can calculate 
the vibrational contribution to the thermodynamic energy 


E v = NkT 2 


d\nq v 

dT 




( 6 - 22 ) 


where ©„ = hvjk and is called the vibrational temperature. Table 6-1 gives ©„ for a 
number of diatomic molecules. The vibrational contribution to the heat capacity is 



(6-23) 


Notice that as T -► oo, E v -► NkT and C v -► Nk> a result given in many physical 
chemistry courses and one whose significance we shall understand more fully when we 
discuss equipartition of energy. 

Figure 6-3 shows the vibrational contribution of an ideal diatomic gas to the molar 
heat capacity as a function of temperature. 

An interesting quantity to calculate is the fraction of molecules in excited vibrational 
states. The fraction of molecules in the vibrational state designated by n is 


/„ = 


e -fihv( n +lf2 ) 


^Zvib 


(6-24) 


This equation is shown in Fig. 6-4 for Br 2 at 300°K. Notice that most molecules are 
in the ground vibrational state and that the population of the higher vibrational states 



Figure 6-3. The vibrational contribution of an ideal diatomic gas to the molar heat capacity as a function 
of temperature. Room temperature is typically (XO.lGuXC/. Table 6-1). 
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Figure 6-4. The population of the vibrational levels of Br 2 at 300°K. 


decreases exponentially. Bromine has a force constant smaller than most molecules, 
however (c/. Table 6-1), and so the population of excited vibrational levels of Br 2 is 
greater than most other molecules. Table 6-2 gives the fraction of molecules in all 
excited states for a number of molecules. This fraction is given by 

oo p -fihv{n+lJ2) 

/„> 0 = E 1 -fo = e~ fihv = e"®* 77 ' (6-25) 

n— 1 (Zvib 

Table 6-2. The fraction of molecules in excited vibrational states at 300°K and 1000°K 


gas 

©i>. °K 

e 

300°K 

-e v /t 

1000°K 

h 2 

6215 

1.04 x 10" 9 

2.03 x 10" 3 

HC1 

4227 

1.02 X 10"« 

1.59 x 10" 2 

N 2 

3374 

1.51 x 10" 5 

3.55 X lO" 2 

CO 

3100 

3.71 x 10" 5 

4.65 x 10" 2 

Cla 

810 

6.72 x 10~ 2 

4.45 x 10-* 

la 

310 

3.56 x 10" 1 

7.33 x 10" * 


6-3 THE ROTATIONAL PARTITION FUNCTION OF A 
HETERONUCLEAR DIATOMIC MOLECULE 

For heteronuclear diatomic molecules, the calculation of the rotational partition 
function is straightforward. The rotational partition function is given by 

4,0, CT) = i(2J+ I (6 _ 26) 

J = 0 

In the nomenclature of Chapter 2, Eq. (6-26) is a summation over levels rather than 
over states. 
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The ratio B k is denoted by © r and is called the characteristic temperature of rota- 
tion. This is given in Table 6-1 for a number of molecules. Unlike the vibrational case, 
this sum cannot be written in closed form. However, because © r /J is quite small at 
ordinary temperatures for most molecules, we can approximate this sum by an 
integral. (It is really A elkT = 2 © r (J 4- 1)'T that must be small compared to one, and 
this of course cannot be true as J increases. However, by the time J is large enough to 
contradict this, the terms are so small that it makes no difference.) 

At high enough temperatures, then, 

q rot (T) = f°°(2 J + l)e _0rJ</+ 1)/T dJ (6-27) 

J o 


= Ce~ e ' J < J+i > IT d{J(J + 1)} = J 

J 0 ©r 


87 z 2 lkT 
h 2 


e r <T 


(6-28) 

(6-29) 


This result improves as the temperature increases and is called the high-temperature 
limit. For low temperatures or for molecules with large values of © r , say HD with 
© r = 42.7°K, one can use the sum directly. For example, 

q ro t(T) = 1 + 3e" 20r/T + 5e~ 60r/T + 7c" 120r/T (6-30) 

is sufficient to give the sum to within 0.1 percent for © r > 0.7 T. For © r less than 0.7 T 
but not small enough for the integral to give a good approximation, we need some 
intermediate approximation. 

The replacement of a sum by an integral can be viewed as the first of a sequence of 
approximations. The full scheme is a standard result of the field of the calculus of 
finite differences and is called the Euler-MacLaurin summation formula. It states that 
if f(n) is a function defined on the integers and continuous in between, then 

fmdn+xm+m} 

J a 

+ t(-y$r, {/ (2i_ °(«) -f'v-'Kb)} ( 6 - 31 ) 

j=i ( 2 /;! 

where / (k) (a) is the fcth derivative of / evaluated at a. The B/s are the Bernoulli 
numbers, B t = B 2 = ^o, B 3 = Before applying this to q tot {T), let us apply 

it first to a case we can do exactly. Consider the sum [cf. Eq. (6-20)] 



00 



1 


1 -e~ a 


Applying the Euler-MacLaurin summation formula, we get 

£ —a i 1 1 a a 3 

,5o C = a + 2 + i2 _ 720 + ‘" 

The expansion of (1 — e _ “) _1 is 

1 1 1 1 a a 3 

a 2 a 3 ~ a + 2 + 12 ~ 720 + 

T - + T 


(6-32) 


(6-33) 


1 — e~ a 


(6-34) 
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We see that these two expansions are the same. If a is large, we can use the first few 
terms of Eq. (6-33); otherwise, we use the Euler-MacLaurin expansion in a. 
Applying this formula to q rot (T) gives (see Problem 6-9): 



(6-35) 


which is good to within one percent for © r < T. For simplicity we shall use only the 
high-temperature limit in what we do here since © r is T for most molecules at room 
temperature (cf. Table 6-1). 

The rotational contribution to the thermodynamic energy is 


£ rot = NkT 2 ^^ = NkT + ••• 


(6-36) 


and the contribution to the heat capacity is 

C Vt r 0t = Nk + • • • (6-37) 

The fraction of molecules in the Jth rotational state is 


Nj (2J + l)e~ 0rJ(/+1)/r 
~ q rot (T) 


(6-38) 


Figure 6-5 shows this fraction for HC1 at 300°K. Contrary to the vibrational case, 
most molecules are in excited rotational levels at ordinary temperatures. We can find 
the maximum of this curve by differentiating Eq. (6-38) with respect to J to get 



We see then that increases with T and is inversely related to S, and so increases 
with the moment of inertia of the molecule since B oc 1 /. 

The next two sections deal with the rotational partition function of homonuclear 
diatomic molecules. The wave function of a homonuclear diatomic molecule must 
possess a certain symmetry with respect to the interchange of the two identical nuclei 
in the molecule. In particular, if the two nuclei have integral spins, the wave function 
must be symmetric with respect to an interchange; if the nuclei have half odd integer 



Figure 6-5. The population of the rotational levels of hydrogen chloride at 300 D K. 
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spin, the wave function must be antisymmetric. This symmetry requirement has a 
profound effect on the rotational energy levels of a homonuclear diatomic molecule, 
which can be understood only by understanding the general symmetry properties of a 
homonuclear diatomic molecule. This is discussed in the next section. Then in 
Section 6-5 we apply these results to the rotational partition function. We shall see 
there that at low temperatures, the symmetry properties have an important effect on 
the thermodynamic properties of certain molecules, and in particular we shall discuss 
ortho- and para-hydrogen. 

The discussion of the symmetry requirement is somewhat involved, however, and 
so we present here a summary of the high-temperature limit, which for most systems is 
completely adequate. At temperatures such that the summation in Eq. (6-26) can be 
approximated by an integral or even the Euler-MacLaurin expansion, the result for a 
homonuclear diatomic molecule is 


- 2 © r \ l + 3T + 15 



( 6 - 39 ) 


Note that this equation is the same as Eq. (6-35) except for the factor of 2 in the 
denominator. This factor is due to the symmetry of the homonuclear diatomic molecule 
and, in particular, is due to the fact that there are two indistinguishable orientations of 
a homonuclear diatomic molecule. There is a two-fold axis of symmetry perpendicular 
to the internuclear axis. 

Equations (6-35) and (6-39) can be written as one equation by introducing a factor 
g into the denominator of Eq. (6-35). If a = 1, we have Eq. (6-35), and if o = 2, we 
have Eq. (6-39). The factor a is called the symmetry number of the molecule and 
represents the number of indistinguishable orientations that the molecule can have. 
The exact origin of a can only be completely understood from the arguments presented 
in Sections 6-4 and 6-5, but on first reading it is possible to accept the factor of o 
and proceed directly to Eq. (6-47) from here. 


6-4 THE SYMMETRY REQUIREMENT OF THE TOTAL WAVE 
FUNCTION OF A HOMONUCLEAR DIATOMIC MOLECULE 

The calculation of the rotational partition function is not quite so straightforward 
for homonuclear diatomic molecules. The total wave function of the molecule, that is, 
the electronic, vibrational, rotational, translational, and nuclear wave function, must 
be either symmetric or antisymmetric under the interchange of the two identical 
nuclei. It must be symmetric if the nuclei have integral spins (bosons), or antisymmetric 
if they have half-integral spins (fermions). This symmetry requirement has profound 
consequences on the thermodynamic properties of homonuclear diatomic molecules 
at low temperatures. We shall discuss the interchange of the two identical nuclei of a 
homonuclear diatomic molecule in this section, and then apply the results to the 
calculation of q TOt in the next section. 

It is convenient to imagine this interchange as a result of (1) an inversion of all the 
particles, electrons and nuclei, through the origin, and then (2) an inversion of just 
the electrons back through the origin. This two-step process is equivalent to an ex- 
change of the nuclei. Let us write ^ tota , exclusive of the nuclear part as 

^ total = ^trans^rot^vib^elec 

where the prime on ^ tota , indicates that we are ignoring the nuclear contribution for 
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now. The translational wave function depends only upon the coordinates of the center 
of mass of the molecule, and so this factor is not affected by inversion. Furthermore, 
tf'vib depends only upon the magnitude of (r — r e ), and so this part of the total wave 
function is unaffected by any inversion operation. Therefore, we concentrate on ij/ clcc 
and ^ rot . 

The property of \j/ clcc under the inversions in both Steps (1) and (2) above depends 
upon the symmetry of the ground electronic state of the molecule. The ground elec- 
tronic state of most molecules is symmetric under both of these operations. Such a 
state is designated by the term symbol . Thus it is ^ rot that controls the symmetry of 

^total * 

Only Step (1) above, the inversion of both electrons and nuclei through the origin, 
affects ^ rot . The effect of this inversion is to change the coordinates (r, 0, <j>) that 
describe the orientation of the diatomic molecule into (r, n — 0, </> 4- n). One can see 
this either analytically from the eigenfunctions themselves or pictorially from the 
rotational wave functions shown in Fig. 6-6. Notice that the rigid rotor wave functions 
are the same functions as the angular functions of the hydrogen atom. 

The net result then, when the ground electronic state is symmetric, that is, 
is that ^ total remains unchanged for even J and changes sign for odd J. This result 
applies to the total wave function, exclusive of nuclear spin. 

Now consider a molecule such as H 2 , whose nuclei have a spin of Just as in the 
case of the two electrons in the helium atom, the two nuclei of spin \ have three sym- 
metric spin functions aa, /?/?, and 2“ 1/2 (a/? -I- /fa), and one antisymmetric spin function 
2~ 1/2 (aft — /fa). Since nuclei with spin \ act as fermions, the total wave function must 
be antisymmetric in the exchange of these two nuclei. Now states with both even and 
odd values of J can be brought to the required antisymmetry by coupling them with 
the right spin functions. Since three symmetric nuclear spin functions can be combined 
with the odd J levels to achieve the correct overall antisymmetry for *£+ electronic 
states, we see that the odd J levels have a statistical weight of 3, compared to a weight of 
1 for even J levels. This leads to the existence of ortho- (parallel nuclear spins) states and 
para- (opposed nuclear spins) states in H 2 . This weighting of the rotational states will be 
seen shortly to have a profound effect on the low-temperature thermodynamics of H 2 . 

More generally, for nuclei of spin /, there are 2J + 1 spin states for each nucleus. 
Let the eigenfunctions of these spin states be denoted by oq, <x 2 , . . . , <x 2/+1 . There are 
(2 1 + l) 2 nuclear wave functions to include in ^ tota , . (In the case of H 2 , / = i, there 
are two spin states a and /?, and there are four nuclear spin functions, three of which are 
symmetric and one of which is antisymmetric.) The antisymmetric nuclear spin func- 
tions are of the form <x f (l)a y (2) — a f (2)a;(l), 1 < i,j < 21 + 1. There are (21 + l)(2/)/2 
such combinations, and so this is the number of antisymmetric nuclear spin functions. 
(For H 2 , we find that there is only one antisymmetric choice in agreement with the 
above paragraph.) All the remaining (21 + l) 2 total nuclear spin functions are sym- 
metric, and so their number is (21 + l) 2 — 1(21 + 1) = (I + 1)(2/ + 1). Thus we can 
write the following summary for states; 

integral spin 

1(21 + 1) antisymmetric nuclear spin functions couple with odd J 
(I + 1)(27 + 1) symmetric nuclear spin functions couple with even J 
half-integral spin 

1(21 + 1) antisymmetric nuclear spin functions couple with even J 
(I + 1)(2J + 1) symmetric nuclear spin functions couple with odd J 
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Figure 6-7. The vibration-rotation spectrum of acetylene. This represents one vibrational line. The 
alternation in the intensity of the lines is due to the statistical weights of the rotational 
levels. (From L. W. Richards, J. Chem. Ed ., 43, p. 645, 1966.) 

These combinations of nuclear and rotational wave functions produce the correct 
symmetry required of the total wave function under interchange of identical nuclei. 
Remember that all of these conclusions are for electronic states, the most com- 
monly occurring ground state. (See Problem 6-26 for a discussion of 0 2 .) 

Even though we have considered only diatomic molecules here, the results of this 
section apply also to linear polyatomic molecules such as C0 2 , H 2 C 2 . For example, 
the molecules HC 12 C 12 H and DC 12 C 12 D have their rotational states weighted in a 
similar way as H 2 and D 2 . Figure 6-7 shows the vibration-rotation spectrum of 
H 2 C 2 . The alternation in the intensity of these rotational lines due to the statistical 
weights is very apparent. 

6-5 THE ROTATIONAL PARTITION FUNCTION OF 
A HOMONUCLEAR DIATOMIC MOLECULE 

The results of the previous section show that for homonuclear diatomic molecules 
with nuclei having integral spin, rotational levels with odd values of J must be coupled 
with the 1(21 + 1) antisymmetric nuclear spin functions, and that rotational levels with 
even values of J must be coupled with the (I 4- 1)(2J + 1) symmetric nuclear spin 
functions. Thus we write 

4,0,. nucCD = (/ + 1X2/ + 1) I (2 J + l)e _e ’ J(J+1)/T 

/even 

+ 1(21 + 1)5] (2 J + l)e -e '" ,(7+1)/T (6-40) 

/ odd 

Likewise, for molecules with nuclei with half-integer spins, we have 
9ro.,„uc(n = K2I + 1) £ (2/ + i) e - Wivr 

J even 

+ (/ + 1X2/ + 1) Y (2J + l)e -0rJ(J+ 1)/T (6-41 ) 

Jodd 

Notice that in this case the combined rotational and nuclear partition function does 
not factor into q TOt q nucl . This is a situation in which we cannot ignore q nucl . For most 
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molecules at ordinary temperatures, © r T, and we can replace the sum by an 
integral. We see then that 


I * I *^fV + lK e ^ (J+1)/r dJ=r^- 

Jeven J odd ^ all J Z *0 ZxJ r 


and so both Eqs. (6-40) and (6-41) become 

(21 + 1 ) 2 T 


*7rot, nucl(^) 


2© r 


(6-42) 


(6-43) 


which can be written as q [ol (T)q nucl where 


q rot (T) = -T and q nucl = (21 + l) 2 (6-44) 

2© r 

For Eq. (6-42) to be valid, &JT must be less than about 0.20. 

This result is to be compared to the result for a heteronuclear diatomic molecule, 
namely, g I0t (T) = T © r . The factor of 2 that appears above in the high-temperature 
limit takes into account that the molecule is homonuclear, and so its rotational parti- 
tion function is given by Eq. (6-40) or (6-41) instead of (6-26). This factor of 2 is 
called the symmetry number and is denoted by a. It legitimately appears only when 
© r is less than approximately 0.2T, since only then can we use Eq. (6-42). Under- 
standing the origin of this fact then, we can write 


9rot (T) » 


&n 2 IkT 

ah 2 


1 

a 


X(2/ + l)e- e ' J < /+1 >' T 

J=0 


e r <T 


(6-45) 


where a = 1 for heteronuclear molecules, and a = 2 for homonuclear diatomic mole- 
cules. Remember that this is applicable only to the high-temperature limit or its Euler- 
MacLaurin correction. A similar factor will appear for polyatomic molecules also. 

There are some interesting systems in which & r s T is not small. Hydrogen is one 
of the most important such cases. Each nucleus in H 2 has nuclear spin and so 

0,ot,„uci = 1 V J + l)c- e ^ +1)/T +3 £ (2J+ l)e“ e ^ (J+1)T (6-46) 

J even / odd 

The hydrogen with only even rotational levels allowed (antisymmetric nuclear spin 
function or “opposite” nuclear spins) is called para-hydrogen; that with only odd 
rotational levels allowed (symmetric nuclear spin function or “parallel” nuclear 
spins) is called ortho-hydrogen. The ratio of the number of ortho-H 2 molecules to the 
number of para-H 2 molecules is 

3 E(2J+ i) e - e ^ +1)/r 

™ ortho J odd 

ZiV+Qe-M+w 

/eve n 

Figure 6-8 shows the percentage of p-H 2 versus temperature in an equilibrium mixture 
of ortho- and para-hydrogen. Note that the system is all para- at 0°K and 25 percent 
para- at high temperatures. 

Figure 6-9 illustrates an interesting situation that occurs with low-temperature heat 
capacity measurements on H 2 . Equation (6-46) can be used to calculate the heat 
capacity of H 2 , and this is plotted in Fig. 6-9, along with the experimental results. It 
can be seen that the two curves are in great disagreement. These calculations and 
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Figure 6-8. The percentage of para-hydrogen in an equilibrium mixture as a function of temperature. 



Figure 6-9. The rotational and nuclear contribution to the molar heat capacity for ortho-hydrogen, 
para-hydrogen, an equilibrium mixture of ortho- and para-hydrogen, a metastable 75 
percent ortho- and 25 percent para- mixture, and the experimental data. (From K. F. 
Bonhoeffer and P. Harteck, Z. Physikal. Chem., 4B, p. 113, 1929.) 
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measurements were made at a time when quantum mechanics was being developed, 
and was not accepted by all scientists. For a while, the disagreement illustrated in 
Fig. 6-9 was a blow to the proponents of the new quantum mechanics. It was Dennison* 
who finally realized that the conversion between ortho- and para-hydrogen is extremely 
slow in the absence of a catalyst, and so when hydrogen is prepared in the laboratory 
at room temperature and then cooled down for the low-temperature heat capacity 
measurements, the room-temperature composition persists instead of the equilibrium 
composition. Thus the experimental data illustrated in Fig. 6-9 are not for an equili- 
brium system of ortho- and para-hydrogen, but for a metastable system whose 
ortho-para composition is that of equilibrium room-temperature hydrogen, namely, 
75 percent ortho- and 25 percent para-. If one calculates the heat capacity of such a 
system, according to 


C v = £C K (ortho-) + iCV(para-) 


where C K (ortho-) is obtained from just the second term of Eq. (6-46), and C v ( para-) is 
obtained from the first term of Eq. (6-46), one obtains excellent agreement with the 
experimental curve. A clever confirmation of this explanation was shortly after 
obtained by Bonhoeffer and Harteck,f who performed heat capacity measurements 
on hydrogen in the presence of activated charcoal, a catalyst for the ortho-para 
conversion. This produces an equilibrium system at each temperature. The experi- 
mental data are in excellent agreement with the equilibrium calculation in Fig. 6-9. 

The explanation of the heat capacity of H 2 was one of the great triumphs of post- 
quantum mechanical statistical mechanics. You should be able to go through a similar 
argument for D 2 , sketching the equilibrium heat capacity, the pure ortho- and para- 
heat capacity, and finally what you should expect the experimental curve to be for D 2 
prepared at room temperature and at some other temperature, say 20°K. (See 
Problem 6-17.) 

In principle, such nuclear spin effects should be observable in other homonuclear 
molecules, but a glance at Table 6-1 shows that the characteristic rotational tem- 
peratures for all the other molecules are so small that these molecules reach the 
“high-temperature limit” while still in the solid state. Hydrogen is somewhat unusual 
in that its rotational constant is so much greater than its boiling point. 

For most cases then, we can use Eq. (6-45) which, when we use the Euler-MacLaurin 
expansion, becomes 


T L @ r 1 /©A 2 4 /0 r \ 3 \ 

9ro,( )- <7® r ( 1 + 3T + 15\T) + 315\77 + J 


(6-47) 


Usually only the first term of this is necessary. Some of the thermodynamic functions 
are 


li-S-sGM <6 ^’ 

c “""*{ i+ s(t) , + -) (6 - 49> 


* D. M. Dennison, Proc . Roy. Soc. A115, 483, 1927. 
t K. F. Bonhoeffer and P. Harteck, Z. Phys. Chem. t 4B, 1 13, 1926. 



108 IDEAL DIATOMIC GAS 


S,„-Nt{l-ln(^)-l(|) ! + -} ,6-50, 

where all of these formulas are valid in the same region, in which o itself is a meaningful 
concept, that is, 0 r < 0.2 T. The terms in 0 r T and its higher powers are usually not 
necessary. Note that Eq. (6-47) is identical to Eq. (6-35) except for the occurrence of 
the symmetry number in Eq. (6-47). 


6-6 THERMODYNAMIC FUNCTIONS 

Having studied each contribution to the total partition function q in Eq. (6-17), we 
can write in the harmonic oscillator-rigid rotor approximation 


q{V, T ) = 


/2nmkT\ 3/2 7 8t z 2 I1cT 

v - * 5 / v ~oir 


e -fifyl2^ _ e -l >h '’)-'( Dei e D ' lkT 


(6-51) 


Remember that this requires that © r < T, that only the ground electronic state is 
important, and that the zero of energy is taken to be the separated states at rest in their 
ground electronic states. Note that only < 7 trans is a function of V, and this is of the form 
f(T)V which, we have seen before, is responsible for the ideal gas equation of state. 
The thermodynamic functions associated with Eq. (6-51) are 


E 5 hv hv/kT D e 

NkT ~2 + 2kT + e hv/kT - 1 “ kT 

C v 5 /hv\ 2 <? vlkT 
Nk~2 + (e hvlkT - l) 2 

4 =h> ry"»iy^ a/a ^ +ln . 

Nk L h 2 \ N 

hv/kT 

+ e hv/kT _ j 


Zn 2 IkTe 

ah 2 

- ln(l - e~ hv/kT ) + In a> el 


pV = NkT 


AT) 

kT 


= —In 


2n(m i + m 2 )k 


T 


3/2 ,_ , %n 2 IkT 

kT n ah 2 + 


hv 

2kT 


+ ln(l - e ~^ kT ) -^-ln <o el 


(6-52) 

(6-53) 


(6-54) 

(6-55) 


(6-56) 


Table 6-1 contains the characteristic rotational temperatures, the characteristic 
vibrational temperatures, and D 0 = D e — \hx for a number of diatomic molecules. 

Table 6-3 presents a comparison of Eq. (6-54) with experimental data. It can be 
seen that the agreement is quite good and is typical of that found for the other thermo- 
dynamic functions. It is possible to improve the agreement considerably by including 
the first corrections to the rigid rotor-harmonic oscillator model. These include centrif- 
ugal distortion effects, anharmonic effects, and other extensions. The consideration 
of these effects introduces a new set of molecular constants, all of which are deter- 
mined spectroscopically and are well tabulated. (See Problem 6-24.) The use of such 
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Table 6-3. The entropies of some diatomic molecules calculated according to Eq. (6-54) compared to 
the experimental values at 1-atm pressure and 25°C* 



S(cak.) 

(e.u.) 

S(exp.) 

(e.u.) 

h 2 

31.1 

31.2 

o 2 

49.0 

49.0 

n 2 

45.7 

45.7 

Cl 2 

53.2 

53.3 

HC1 

44.6 

44.6 

HBr 

47.4 

47.4 

HI 

49.4 

49.3 

CO 

47.2 

46.2 


* The experimental values have been corrected for any nonideal gas behavior. 


additional parameters from spectroscopic data can give calculated values of the entropy 
and heat capacity that are actually more accurate than experimental ones. 

It should be pointed out, however, that extremely accurate calculations can require 
a sophisticated knowledge of molecular spectroscopy. For example, we said above 
that the electronic partition function was similar to that in the atomic case. This, 
however, is not entirely true. For molecules in states other than a £ state (which has 
zero total angular momentum), the total electronic angular momentum must be 
coupled with the overall rotational angular momentum, and this coupling must be 
treated in a detailed quantum mechanical way. This is too specialized to discuss here, 
but the result of this coupling is that the electronic and rotational partition functions 
do not separate. When T > 0 r , however, the molecules are in states with large enough 
rotational quantum numbers [cf. Eq. (6-39)] that the angular momentum coupling is 
no longer important, and the rotational-electronic partition function separates into a 
rotational part and an electronic part. Since we have chosen the zero of energy to be 
the separated electronically unexcited atoms at rest, the electronic partition function is 

q e (T) = w el e DclkT + a) e2 e~ tllkT + • • • 

where the w ej are the degeneracies, and the e/s are measured relative to the ground 
electronic state of the molecule. Keep in mind, however, that for some molecules, such 
as NO, this equation is valid only at high temperatures, and that the low-temperature 
partition function requires a fairly specialized knowledge of the coupling of electronic 
and rotational angular momenta. See Herzberg under “Additional Reading” for a 
thorough discussion of this complication. 

It seems logical at this time to go on to a discussion of polyatomic molecules in 
much the same manner as we have for diatomics. We would see very quickly, however, 
that unless the molecule possesses a certain degree of symmetry, it is impossible to 
write down any closed-form expression for its rotational energy levels. This means that 
a calculation of q rot (T) is at best a complicated numerical problem. This would appear 
to imply that we have come to the end of the line for statistical thermodynamical 
applications, and we have not even begun to consider interactions between molecules! 
Even just two simple molecules, to say nothing of N particles, interacting through any 
kind of realistic interatomic potential becomes an extremely complicated quantum 
mechanical problem. At this point we must go back and reconsider some of the results 
we have derived up to now. 
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PROBLEMS 

6-1. The Morse potential is 
U(r) = D e (l - 


Show that p = v(2t rVW 1/2 . 

6-2. The dissociation energy D 0 of H 2 is 103.2 kcal/mole, and its fundamental vibrational 
frequency w is 4320 cm -1 . From this information, calculate D 0 and w for D 2 , T 2 , and HD, 
assuming the Bom-Oppenheimer approximation. 

6-3. Given that D 0 for H 2 is 103.2 kcal/mole and that ©„ is 6215°K, calculate D 0 for both 
D 2 and T 2 . 

6-4. Show that the moment of inertia of a diatomic molecule is p r e 2 , where p is the reduced 
mass, and r € is the equilibrium separation. 

6-5. Show that the force constants in Table 6-1 are consistent with the frequencies given 
there. 

6-6. Using the data in Table 6-1, calculate the frequencies that are expected to be found 
in the rotational spectrum of HCl. 

6-7. In the far infrared spectrum of HBr, there is a series of lines separated by 16.72 cm -1 . 
Calculate the moment of inertia and intemuclear separation in HBr. 

6-8. Show that the vibrational contribution to the heat capacity C v of a diatomic molecule 
is Nk as T-+ oo. 

6-9. Derive Eq. (6-35) from the Euler-MacLaurin summation formula. 

6-10. Show that the rotational level that is most populated is given by J mzx = (kT/2B) 1/2 . 
Calculate / ma * for C0 2 and H 2 at room temperature. 

6-11. The rotational constant B for HC 12 N 14 is 44,315.97 MHz (megahertz) and DC 12 N 14 
for 36,207.40 MHz. Deduce the moments of inertia for these molecules. Assuming that the 
bond lengths are independent of isotopic substitution, calculate the H-C and C-N bond 
length. 

6-12. Given that the values of © r and ©„ for H 2 are 85.3 K and 621 5°K, respectively, 
calculate these quantities for HD and D 2 . 

6-13. What is the most probable value of the rotational quantum number / of a gas phase 
N 2 molecule at 300°K? What is the most probable vibrational quantum number n for this 
same situation? 
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6-14. Using the Euler-MacLaurin expansion, derive the second- and third-order correc- 
tions to the (first-order) high-temperature limit of E r and CV> . Express your result in terms of 
a power series of © r /r. 

6-15. Calculate the rotational contribution to the entropy of HD at 2(TK, 100 'K, and 
300°K, using the formulas appropriate for each particular temperature, and estimate the 
error involved in each. 

6-16. Discuss the statistical weights of a hypothetical diatomic molecule X 2 with a ground 
electronic state 2«T> supposing the X nuclei have integral spin (bosons) and half-integral 
spin (fermions). Derive the rotational partition function and the rotational contribution to 
the heal capacity CV for each case. 

6-17. Calculate the percent of para-D 2 as a function of temperature (assuming equilib- 
rium) and also calculate the heat capacity of the equilibrium mixture, para-D 2 , ortho-D 2 , 
and finally what you expect would be the experimental heat capacity. 

6-18. Why does one not see discussions in the literature concerning the ortho-para forms 
of fluorine? 

6-19. Show that the thermodynamic quantities p and C v are independent of the choice 
of a zero of energy. 

6-20. In the far infrared spectrum of HC1, there is a series of lines with an almost constant 
spacing of 20.7 cm -1 . In the near infrared spectrum, there is one intense band at 3.46 microns. 
Use these data to calculate the entropy of HC1 at 300°K and 1 atm (assuming ideal behavior). 

6-21. Molecular nitrogen is heated in an electric arc, and it is found spectroscopically that 
the relative populations of excited vibrational levels is 

0 12 3 4 

1.000 0.200 0.040 0.008 0.002 

Is the nitrogen in thermodynamic equilibrium with respect to vibrational energy ? What is the 
vibrational temperature of the gas? Is this necessarily the same as the translational tempera- 
ture? 

6-22. Without looking anything up, put in order of decreasing magnitudes the following 
“temperatures”: 

©„ H2 , © r H2 , © v c ' 2 , © r c ' 2 , ©„ HC1 , © r HC1 

6-23. A more accurate expression for the vibrational energy of a diatomic molecule is 
e„ = (n+ i)hv — x£n + $) 2 hv 

where x e is called the anharmonicity constant. The additional term here represents the first 
deviations from strictly harmonic behavior. Treating x e as a small parameter, calculate the 
anharmonic effect on the various thermodynamic functions at least to first order in x e . 

6-24. The model of a diatomic molecule presented in this chapter is called the rigid rotor- 
harmonic oscillator model. The rotational-vibrational energy in this approximation is 

Evr = (n + i)hv + BJ(J + 1) 

This expression can be improved in a number of ways. The harmonic oscillator approximation 
can be modified to include terms that reflect the deviations from harmonic behavior (an- 
harmonicity) as the vibrational energy of the molecule increases. This is done by quantum 
mechanical perturbation theory, which gives 


n 

f JL 

fo 


e v = (n + i)hv — x e (n + t) 2 hv H 
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where x e is a small constant called the anharmonicity constant. In addition to this, there is a 
correction due to the fact that the molecule is not a rigid rotor and, in fact, stretches some as 
the molecule rotates with greater energy. This is also handled by perturbation theory and 
gives 


B r = BJ(J + 1) - DJ\J + l) 2 + * * * 

where D is a small constant called the centrifugal distortion constant. Lastly, there exists a 
coupling between the rotational and vibrational modes of the molecule, since its moment of 
inertia changes as the molecule vibrates. Putting all this together gives 

E vr — (n 4- i)hv + BJ(J + 1) — x e (n + i) 2 hv — DJ 2 (J + l) 2 — a(n + $J(J + 1) 

where a is the rotation-vibration coupling constant. These terms, which correct the rigid 
rotor-harmonic oscillator approximation, are usually quite small. Using this more rigorous 
expression for e vr , show that the molecular partition function can be written in the form 


q(V, T) — ^rr-ho^fcorr 

where 


4?cor f — 1 


2kT /D 
B 


e fihv -l 


2fihv 

(e fihv - 1 ) 2 


x e 4- higher-order terms in D, a, and x e 


Calculate the effect of q co « on E and C v for 0 2 at 300°K, given the following values of the 
spectroscopic parameters: x e = 0.0076, D = 4.8 x 10“ 6 cm -1 , and a = 0.016 cm -1 . 

6-25. Consider a system of independent diatomic molecules constrained to move in a plane, 
that is, a two-dimensional ideal diatomic gas. How many degrees of freedom does a two- 
dimensional diatomic molecule have? Given that the energy eigenvalues of a two-dimensional 
rigid rotor are 




h 2 J 2 


J = 0 , 1 , 2 ,... 


with a degeneracy a> j = 2 for all / except J 0, calculate the rotational partition function. I is 
the moment of inertia of the molecule. The vibrational partition function is the same as for a 
three-dimensional diatomic gas. Write out 

qiT') — ^transCf^rotlTD^vIbll} 

and derive an expression for the average energy of this two-dimensional ideal diatomic gas. 

6-26. Show that the molecule 0 16 0 16 has only odd rotational levels in its ground 
electronic state. 



CHAPTER 1 


CLASSICAL STATISTICAL 
MECHANICS 


So far we have been able to derive translational, rotational, and vibrational partition 
functions for linear molecules. In each case we saw that if the temperature were high 
enough we could replace sums by integrals and obtain high-temperature limits. These 
prove to be numerically satisfactory for most gases at ordinary temperatures (with the 
exception perhaps of the vibrational case). 

As the temperature increases, the average energy per molecule increases, and so in a 
quantum mechanical sense, the quantum numbers describing this motion ( n x for 
translational, J for rotational, etc.) also increase, meaning that the molecules are in 
the high quantum number limit. For example at room temperature translational 
quantum numbers are typically 10 8 . (See Problem 5-3.) It is the recognition of this 
fact that will point the way to a solution to the problem discussed in the last paragraph 
of Chapter 6. 

It is one of the fundamental principles of quantum mechanics that classical behavior 
is obtained in the limit of large quantum numbers. So we see that up to now our 
procedure has been to solve a particular quantum mechanical problem, use this 
result in the molecular partition function, use a high-temperature approximation, and 
then find that this high-temperature limit is satisfactory. In other words, we were start- 
ing with a quantum mechanical solution and then taking the classical limit at a later 
stage. It is natural to seek a procedure in which we can use classical mechanics through- 
out, and such an approach is developed in this chapter. For simplicity we shall first 
consider only molecular partition functions, although we shall generalize our results 
afterward. 

7-1 THE CLASSICAL PARTITION FUNCTION 

Consider the molecular partition function 

<1 = E 


(7-1) 
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This is of the form of a sum of e ~ P(eneTgy) over all possible quantum states. It is natural 
to assume that the corresponding classical expression is a similar sum, or since the 
energy in the classical sense is a continuous function of the momenta Pj and coordinates 
qj , this sum would become an integral over all the possible classical “ states ” of the 
system. Since the classical energy is the Hamiltonian function H(p y q) y the molecular 
partition function q(V y T) becomes 

tfcfass -/•••/ q) dp dq (7-2) 

In Eq. (7-2) the notation (p, q) denotes all the momenta and coordinates on which H 
depends; dp stands for dp t dp 2 • * * dp s and dq for dq t • • • dq s , where s is the number of 
momenta or coordinates necessary to completely specify the motion or position of 
the molecule. The quantity s represents the number of degrees of freedom of the 
molecule. The set of coordinates {qj} does not necessarily have to be a set of Cartesian 
coordinates, and more usually represents a set of generalized coordinates, that is, any 
set of coordinates that conveniently specifies the position of the molecule. For a mass 
point, for example, the generalized coordinates might be simply x y y y and z; for a rigid 
rotor, we might choose the two angles 6 and <j> needed to specify the orientation of the 
molecule. Usually the choice of generalized coordinates is obvious. The momenta {pj} 
in Eq. (7-2) are the generalized momenta conjugate to the {qj} [cf. Eq. (1-19)]. 

At this stage Eq. (7-2) is just a plausible conjecture. Let us now pursue this idea by 
considering a monatomic ideal gas once again. From Eq. (5-8), we have 

, T , _ /2t zmkT\ 3/2 Tr 

t) = y h2 j v 

The classical Hamiltonian of one atom of a monatomic ideal gas is simply the kinetic 
energy: 

H = ^~(P X 2 + Py 2 + PS) 


According to Eq. (7-2), then, 

tfciass ~ J ' ■ ■ J ex p| - ^ Px + 2m + Pz dPx dPy dPz dy dz (7-3) 


Notice here that since it takes three coordinates to specify the position of a point 
particle, < 7 class is a six-fold integral. The integral over dx dy dz simply yields the volume 
of the container V y and so we have 


(Zciass ~ e~ M2m dpj 3 = (2nmkT) 3,2 V 


(7-4) 


We see that except for a factor of Planck’s constant cubed, this is just the translational 
partition function that we obtained before. Of course, we cannot expect to derive a 
purely classical expression that contains h y and so although our conjecture may be 
incomplete, there seems to be some element of truth to it. 

Let us see how this procedure works for the other partition functions that we have 
evaluated. For the rigid rotor, the Hamiltonian is 




P* 2 \ 
sin 2 6/ 
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where I is the moment of inertia of the molecule. The generalized coordinates and 
momenta in this case are 9, <f>, p e , and p^, and so Eq. (7-2) is 

q tot ~J J dPe d<p J d8e~ m = (Sn 2 IkT) ( 7 - 5 ) 


For the classical harmonic oscillator. 


2 k 


H = ?- + ^x 2 

2p 2 


and 


where 

v 


flvib~ f dp j dxe pH = — 
J -«> J -oo V 

_ J_ //c\ 1/2 
~2n [pj 


( 7 - 6 ) 


( 7 - 7 ) 


We can see from these three examples that the translational partition function is 
incorrect by a factor of ft 3 ; the rotational partition function is incorrect by a factor 
of ft 2 ; and the vibrational partition function is incorrect by a factor of ft. It appears 
that a factor of ft results for each product dp } dqj occurring in q cbiSS . Since partition 
functions are dimensionless, and ft has units of momentum times length, we see that 
this at least automatically satisfies a dimensional requirement. We shall therefore 
assume that 


q = ^e PH U d Pj d <lj < 7 - 8 > 

We now shall extend this assumption to systems of molecules. Equation (4-10) says 
that at high enough temperatures, we can write for a system of N independent in- 
distinguishable particles 



where Hj is the Hamiltonian of the jth molecule and is a function of p jt , . . . , p }s , 
<lji> • • • > <7;s • We now simply relabel the momenta and coordinates such that p t 
through p s represent p u through p ls ; p s+1 through p s+s represent p 21 through p 2s , 
and so on, and write 



1 

Niff* 1 

1 

N\h* N 


f-fe-’Wndp'dq, 

J J i=l 

nr sM 

j’" j e ~ fH Y[dPidqi 


where H is the Hamiltonian of the N-body system. This form suggests the classical 
limit of Q for systems of interacting particles. We conjecture that 


e= 


i 

Nlh sN 


J • • • je dp dq 


(7-9) 
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where H(p, q) is the classical N-body Hamiltonian for interacting particles. The nota- 
tion (p, q) represents the set of p/s and q/s that describes the entire system, and dp dq 
represents 

sN 

n d Pj 

j= i 

We have assumed then that the classical limit of Q(N , V, T ) is given by 


Q = Z e-**' - j/jft J • ’ • { e-W- «> dp dq (7-1 0) 

For a monatomic gas, for example, 

1 N 

H(p> q) = ^ EW + p yj 2 + ft/) + yi, Zj, . . . , x N , y N , z N ) (7-1 1 ) 

Equation (7-10) is, in fact, the correct classical limit of Q , although we have not proved 
it here. It is actually possible to start with the quantum mechanical sum in Eq. (7-10) 
and to derive the integral as the classical limit, that is, the limiting result as h -► 0 
(cf. Section 10-7). 

If we substitute Eq. (7-11) into Eq. (7-10), the momentum integrations can be done 
easily, and we get 

^ 1 / 2nmkT\ 3N/2 ^ 

6class'~jy|^ ^2 J (7-12) 


where 

Z N = j v e~ U{xi ZN)/kT dx v • • • dz N (7-1 3) 

In Eq. (7-13), Z N is called the classical configuration integral . Since the intermolecular 
forces depend upon the relative distances between molecules, this integral is, in general, 
extremely difficult and is essentially responsible for the research in equilibrium statis- 
tical mechanics. In the absence of intermolecular forces, U = 0 and Z N = V N Equa- 
tions (7-12) and (7-13) are fundamental equations in the study of monatomic, classical, 
imperfect gases and liquids. 

It often happens that not all of the degrees of freedom of a molecule can be treated 
classically. For example, we have seen that the spacing between translational and 
rotational levels is small enough that the sum over states or levels can be replaced by 
an integral, that is, these degrees of freedom can be treated classically. This is not the 
case, however, with the vibrational degrees of freedom, and these degrees of freedom 
must be treated quantum mechanically. 

Suppose, then, that the Hamiltonian of a molecule can be written as 

H = // c i a ss + tfquant (7-14) 

where H class refers to the s degrees of freedom that can be treated classically, and 
// quant refers to the degrees of freedom that cannot be treated classically. Then 


(Z (Z class (Zquant 

where 

((class = e - H ‘ l ™< p - q),kT dp 1 dq r - dp„ dq s 


(7-15) 


(7-16) 
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Note that Eq. (6-51) is of the form of Eq. (7-15), where the translational and rotational 
degrees of freedom are treated classically, and the vibrational and electronic degrees 
of freedom are treated quantum mechanically. 

Equations (7-14) to (7-16) are immediately generalizable to a system of interacting 
molecules. If the Hamiltonian of the entire system is separable into a classical part and 
a quantum part, then 

H = Jf clMB + H quant (7-17) 

6 = 6class6 q uam (7-1 8) 

_ Gquant C -H clas JkTj j , 7 _ iq x 

Nlh sN J ^*PcIass ***/cIass 1 


7-2 PHASE SPACE AND THE LIOUVILLE EQUATION 

Until now our approach has been to go to the classical limit only when it was 
necessary. Historically, however, statistical mechanics was originally formulated by 
Boltzmann, Maxwell, and Gibbs in the nineteenth century before the evolution of 
quantum mechanics. Their formulation, therefore, was based on classical mechanics, 
and since this still is a most useful limit, we shall now discuss the classical mechanical 
formulation of statistical mechanics. This formalism forms the basis of most of the 
work involving interacting systems in equilibrium and nonequilibrium statistical 
mechanics that is done today. 

Consider any classical system containing N (interacting) molecules. Let each mole- 
cule have s degrees of freedom, that is, each molecule requires s coordinates to com- 
pletely describe its position. Let the number of coordinates necessary to describe the 
positions of all N molecules be / = sN. The / coordinates, q l9 q 2 , . . . , q t , then com- 
pletely describe the spatial orientation of the entire N-body system. To each of these 
/ coordinates, there corresponds a conjugate momentum pj , say, defined by Eq. (1-19). 
The / spatial coordinates {qj} and the / momenta {pj} completely specify the classical 
mechanical state of the N-body system. These 21 coordinates, along with the equations 
of motion of the system, completely determine the future and past course of the 
system. 

We now construct a conceptual Euclidean space of 2/ dimensions, with 2/-rectanguIar 
axes, one for each of the spatial coordinates q u . . . , q t and one for each of the momenta 
p u Following Gibbs, we speak of such a conceptual space as a phase space 

for the system under consideration. The state of the classical N-body system at any 
time t is completely specified by the location of one point in phase space. Such a point 
is called a phase point . As the system evolves in time, its dynamics is completely 
described by the motion or trajectory of the phase point through phase space. The 
trajectory of the phase point is given by Hamilton's equations of motion: 


• dH A 

Qj = T~ and 

dpj 


Pj = 


dH 

dqj 


j= 1,2 l = sN 


(7-20) 


In principle, these 21 equations can be integrated to give {qj(t)} and For nota- 

tional simplicity, we shall denote the set of / q's by q(t), and the set of / p’s by p(t). The 
21 constants of integration can be fixed by the location of the phase point at some 
initial time, say t 0 . Of course, in practice, such an integration is not feasible. 
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We now introduce the concept of an ensemble of systems in phase space. For 
simplicity we shall consider a microcanonical ensemble, that is, an ensemble repre- 
sentative of an isolated system. Consider a large number of isolated systems, each 
of which having the same values of macroscopic variables N , V, and E. 

The detailed classical state of each system in the ensemble has a representative 
phase point in the same phase space. The entire ensemble then appears as a cloud of 
points in phase space. As time evolves, each point will trace out its independent 
trajectory. The trajectories are independent, since each one represents an isolated 
system and is, therefore, independent of all the others. The postulate of equal a priori 
probabilities requires that there is a representative phase point in phase space for each 
and every set of coordinates and momenta consistent with the few fixed macroscopic 
variables. In particular, the postulate of equal a priori probabilities states that for a 
microcanonical ensemble, the density is uniform over the constant energy “ surface ” 
in phase space, where the value of the energy on the surface is that of the isolated 
system. We consider all parts of phase space equally important, as long as the (p, q) 9 s 
are consistent with all that we know macroscopically about the system, that is, con- 
sistent with the values of N 9 V 9 and E for the system that the ensemble represents. Just 
as every quantum state was equally likely before, now we consider every classical state 
to be equally probable. 

This cloud of points is very dense then, and we can define a number density /(p, q 9 t) 9 
such that the number of systems in the ensemble that have phase points in dp dq 
about the point p, q at time t is /(p, q 9 t) dp dq. Clearly we must have 

f - \f(P,<l)dpdq = st (7-21) 

The ensemble average of any function, say <j>(p , q), of the momenta and coordinates 
of the system is defined as 

$ = ^ J • • • J 4>(P, 0 dp dq (7-22) 

It is Gibbs’ postulate to equate this ensemble average to the corresponding thermo- 
dynamic function. Note the similarity between this equation and Eq. (2-5), its quantum 
mechanical analog. 

Since the equations of motion determine the trajectory of each phase point, they 
must also determine the density /(p, q 9 1) at any time if the dependence of/on p and q 
is known at some initial time t 0 . The time dependence of / is thus controlled by the 
laws of mechanics and is not arbitrary. The time dependence of / is given by the 
Liouville equation, which we now derive. 

Consider the small volume element 5p x • • • 5p t 5q\'” Sq t about the point p u 
Pi* <h> ■ ’ ’ <h • The number of phase points inside this volume at any instant is 

6N =f(p u .... p„qi, ...,q l ,t)5p l ••• 8p t Sq , • • • 8q, 

This number will, in general, change with time since the natural trajectories of phase 
points will take them into and out of this volume element, and the number passing 
through any one “ face ” will, in general, be different from the number passing through 
the opposite “ face.” Let us calculate the number entering one face and leaving through 
the opposite. Consider two faces perpendicular to the tfj-axis and located at q x and 
<h + The number of phase points entering the first of these faces per unit time is 

fii ’ ‘ * Sq t 5pi 5p t 


(7-23) 
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(See Problem 7-33.) The number passing through the other face per unit time is 
+ Pi, •••, Pi) 

x 4i(<h + Sq l9 q 2 , . . . ,q l9 p u . - . , Pi) Sq 2 ‘ * * 5q t dp x -" Sp t 
which, if we expand / and q Y to linear terms in 5q l9 gives 


(/+ Sq * *** &q t s Pi *'* s Pi 


(7-24) 


Subtracting (7-24) from (7-23), we get the net flow of phase points in the ^-direction 
into the volume element dq x • * * dq^py * * * 8p t : 


net flow = - q t s Pi ' ‘ ' s Pt s <h ' 


in the ^-direction. In a similar manner, the net flow in the ^-direction is (remember 
that momenta and spatial coordinates have equal status in phase space) : 

~(wi pi+f W Spi "' Sp,S9i '" Sq ‘ 

Thus the change in the number of phase points through all the faces is 

{wi h +f *, + Wj p ‘ +f W Sp ' " " ' Sp ' 5,1 " " ' 

This must be equal to the change of SN with time, and so we have 
d(5N ) 




This result can be immediately simplified. Since 


. dH 

dpj 


• _ dH 

Pj d<Ij 


(7-25) 


(7-26) 


the first term in parentheses in Eq. (7-25) is 

+ 0 

spj 


(7-27) 


Furthermore, we divide Eq. (7-25) by the volume element 8p t • • • 8p t 5q 1 8q , . This 

gives the rate of change in the density itself around the point pi, ... p h q x , . . . q u so 
that we can write 


d l 

dt 




(7-28) 


where we have written dfldt to indicate that we have fixed our attention on a given 
stationary point in the phase space. 

Equation (7-28) can be written in a more conventional form by using Eqs. (7-26) 
for <7j and pj . The result is 


dt ~ y= i \dpj dqj dqj dpj 


(7-29) 
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This is the Liouville equation, the most fundamental equation of classical statistical 
mechanics. In fact, it can be shown that the Liouville equation is equivalent to the 6 N 
Hamiltonian equations of motion of the N-body system. See Mazo under “Additional 
Reading” for a proof of this. In Cartesian coordinates, the Liouville equation for 
N point masses is (see Problem 7-1 1) : 


Qf N D N 

37 + I “T ■ V '/ + T. F J ' V p/ = 0 


i = 1 ntj 


(7-30) 


In this equation V rj denotes the gradient with respect to the spatial variables in /; 
V PJ denotes the gradient with respect to the momentum variables in /; and F j is the 
total force on the jth particle. The Liouville equation forms the starting point of most 
theories of nonequilibrium statistical mechanics. 

There are several interesting deductions from the Liouville equation which we now 
discuss. Consider Eq. (7-28) 



Since /= /(p , q , 0* this equation is equivalent to 



(7-31) 


(7-32) 


Physically, this equation says that the density in the neighborhood of any selected 
moving phase point is a constant along the trajectory of that phase point. Thus the 
cloud of phase points behaves as an incompressable fluid. Gibbs called this the 
principle of the conservation of density in phase. An equivalent statement of this is 
that if p, q are the coordinates of a phase point at time I, which at time t 0 were (p 0 , q 0 ), 
then Liouville’s equation implies that (see Problem 7-12) 

f(p,q;t)=f(p 0 ,q 0 ;t 0 ) (7-33) 

Because of the equations of motions, the point (p, q) should be considered a function 
of the initial point (p 0 * <7o) and the elapsed time t. That is 

p = p(Po,qoi 0 
q = q(Po,qo;t ) 

Now let us select a small element of volume at (p 0 , q 0 ) at time t 0 . At a later time, 
t 0 4- t 9 the phase points originally on the surface of this volume element will have 
formed a new surface enclosing a volume element of different shape at the phase point 
(p, 4)' The volume element at (p, q) must contain the same number of phase points 
as the original volume element at (p 0 , q 0 ). This follows because a phase point outside 
or inside the volume element can never cross the surface as the element moves through 
phase space, for otherwise there would be two different trajectories through the same 
point in phase space. This is impossible, however, because of the uniqueness of the 
equations of motion of a phase point. Trajectories of phase points can never cross. 
Now since the density and number of phase points in the volume element are the same 
at Po y tfo a ^d p, q 9 it follows that although the shape of this volume may change and 
contort itself as it moves through phase space, its volume remains constant. Gibbs 
called this result conservation of extension in phase space . This fact is expressed mathe- 
matically by writing 

dp bq = bp 0 bq 0 for all t 


(7-34) 
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Another way of expressing this is to say that the Jacobian of the set (p, q) to (p 0 , q 0 ) 
is unity. This can be proved directly from the equations of motion of the system. See 
Mazo under “Additional Reading.” 

A corollary of this theorem, whose proof demands a more extensive knowledge of 
classical mechanics, is that if we are given two sets of coordinates and their conjugate 
momenta, say. 


#2 > • • • > #3 n 9 Pit Pl> Pzn 

Qu Ql y * * • » C?3n> * ■ • > ^3 n 

which can describe a system in phase space equally well, then 

<ki dq% ' ‘ ’ dq 3 „ dpi--- dp 3 „ = dQi • • • dQ 3n dP 1 • dP 3n 

For example, a single particle in three dimensions may be described by the coordinates 
(x, y, z) or the spherical coordinates (r, 0, <f>). It is straightforward, albeit lengthy to 
show that 

dp x dp y dp z dx dy dz = dp r dp e dp # dr d6 d<f> (7-35) 

Notice that although the volume elements in ordinary coordinate space are dx dy dz 
and r 2 sin 0 dr d6 d<f> , the r 2 sin 0 factor does not occur in the phase space transforma- 
tion. These simple volume element transformations would not generally be true if we 
have chosen the generalized coordinates and velocities instead of momenta. This is 
one reason why momenta and not velocities are used to describe classical systems. 


7-3 EQUIPARTITION OF ENERGY 


We have seen that classical statistical mechanics is applicable when the temperature 
is high enough to replace the quantum statistical summation by an integral. Under 
these conditions, it is not necessary to know the eigenvalues of the quantum mechan- 
ical problem, only the classical Hamiltonian is required. There is an interesting theorem 
of classical statistical mechanics which can be used to understand more fully some of 
the results of the last two chapters. 

Consider the expression for the average energy of a molecule in a system of inde- 
pendent molecules. 


JJ He dp t • • • dq s 

JJ e ~ fH d Pi ' d 4s 


(7-36) 


which can be evaluated in principle for any known dependence of H on the p’s and 
the #’s. Multiplying by the total number of molecules gives an expression for the total 
energy of the system, and by differentiating with respect to T, we obtain an expression 
for its heat capacity at constant volume. 

If it so happens that the Hamiltonian is of the form 


m n 

fi(Pi,P2 <Z S )= Z a jPj 2 + Z M/ + H (Pm+l>--->P S >9n+I,--->g s ) (7-37) 

j = 1 J=1 

where the dj and bj are constants, then it is easy to show that each of these quadratic 
terms will contribute kT\2 to the energy and kj2 to the heat capacity. (See Problem 
7-29.) This result is called the principle of equipartition of energy. It should be 
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emphasized that the principle is a consequence of the quadratic form of terms in the 
Hamiltonian, rather than a general consequence of classical statistical mechanics. 

Let us apply this general theorem to some of the cases we have treated in Chapters 
5 and 6. For instance, for a monatomic ideal gas, the Hamiltonian is 


H = 


Px + Pj + Pz 
2m 


(7-38) 


Since there are three quadratic terms, each atom contributes 3 kT f 2 to the total 
energy and so 3k , 1 2 to the constant volume heat capacity. This is exactly our result in 
Chapter 5. For the case of a rigid rotor, the Hamiltonian is 


" 4 ( 


Pe 2 + 


P * \ 
sin 2 ej 


The sin 2 6 in the p# 2 term would seem to exclude the p# 2 term from the principle of 
equipartition, since Eq. (7-37) requires that the coefficients a s be constants. There is a 
more general version, however, that allows the a s and b } to be functions of the momenta 
and coordinates not involved in- the quadratic terms, that is, to be functions of 
Pm+u • • • , p 5 and q„+ 1 , . . . , q s in Eq. (7-37). The proof of this is more difficult than the 
proof of the simpler version. (See either Problem 7-30 or Tolman under “Additional 
Reading.”) Because of this, each quadratic term above still contributes its equipartition 
value, and so the rotational contribution of a rigid rotor to the energy is kT per 
molecule, just as we obtained in Chapter 6 [cf Eq. (6-36)]. 

Note that equipartition is a classical concept, that is, the degree of freedom con- 
tributing must be such that AejkT is small in passing from one level to another. We 
have seen that this is true for translational and rotational degrees of freedom at 
ordinary temperatures, but not vibrational degrees of freedom. The heat capacity for 
an ideal diatomic gas in the rigid rotor-harmonic oscillator approximation is 
[cf. Eq. (6-53)] 


C v = \Nk + 


Nk(QJT) 2 e Gv/T 
(e* v/T - l) 2 


(7-39) 


where the \Nk comes from the translational plus rotational degrees of freedom which, 
we have seen, are excited enough to be treated classically. The second term is the 
vibrational contribution, which reaches its expected classical limit of Nk , since 
the classical Hamiltonian for a harmonic oscillator is (p 2 >2m) + (k2)x 2 ), when QJT 
becomes small, which is far above room temperature for most molecules. A value of 
the vibrational contribution to C v differing from Nk is thus a quantum mechanical 
result. 

There are more general formulations of the principle of equipartition of energy 
than we have given here, but they are not necessary for most purposes. In fact, the 
principle itself is perhaps more of historical interest today than actual practical interest. 
It is interesting to note in this regard that when the electronic structure of atoms and 
metals evolved toward the end of the nineteenth century, it was of great concern to 
Gibbs that the electrons contributed only a very small fraction of their equipartition 
value to the heat capacities of metals. He did not live to see this anomalous result 
completely explained by quantum statistics. Since electrons have such a small mass, 
they behave not at all classically and should, therefore, not be governed by the equi- 
partition of energy (cf Section 10-2). 
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We have made this long detour through phase space for more than just historical 
reasons. As we said earlier, most of the systems of interest to chemists can be treated 
very satisfactorily by classical methods. In fact, the quantum statistical theories of 
systems of interacting particles are quite a demanding and specialized subject whose 
techniques are still being developed. Fortunately, being chemists, we are spared from 
having to master these techniques. Even today the classical Liouville equation forms 
the starting point for most of the rigorous approaches to nonequilibrium statistical 
mechanics. We shall now discuss the problem that sent us here in the first place, namely, 
the study of ideal polyatomic gases. 
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PROBLEMS 

7-1. Show that at room temperature the translational quantum numbers are typically 
around 10 8 or so. 

7-2. What is the constant energy surface in phase space for a simple linear harmonic 
oscillator? What is it for a single-point mass? What is it for an ideal gas of Appoint masses? 

7-3. Convince yourself that trajectories in phase space can never cross, also that surfaces 
(really hypersurfaces) of constant energy can never intersect if the energies are different. 

7-4. Consider a classical ideal gas enclosed in an infinitely tall cylinder in a gravitational 
field. Assuming that the temperature is uniform up the cylinder, derive the barometric 
formula 

( — mgz 

From this calculate the atmospheric pressure at the top of Mt. Everest. 

7-5. An ideal gas consisting of N particles of mass m is enclosed in an infinitely tall 
cylindrical container placed in a uniform gravitational field, and is in thermal equilibrium. 
Calculate the classical partition function, Helmholtz free energy, mean energy, and heat 
capacity of this system. 

7-6. Consider a perfect gas of molecules with permanent electric dipole moments in an 
electric field 8. Neglecting the polarizability of the molecules, the potential energy is 

U — cos 6 

where 6 is the angle between ji and 8 . Using classical mechanics, derive an expression for the 
additional effect of 8 on the energy E and heat capacity of the gas. 
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7-7. The potential energy of N molecules in a container V can often be fairly well ap- 
proximated by a sum of pair-wise potentials: 

U (?1 In) 1 u(r,,rj) 

KJ 

In addition, the pair-wise potentials u(r ty tj ) are often assumed to depend only upon the 
distance r tJ = | r f — r,| between the two molecules. Thus one often writes 

U(r In) 2 u(r,j) 

KJ 

Convince yourself that even these two simplifications of U do not help in trying to evaluate 
the configuration integral Z. 

7-8. It is possible to determine the value of Boltzmann’s constant by observing the distribu- 
tion of suspended Brownian particles in a gravitational field as a function of their height z. 
Given that the particles have a mass of 1.0 x 10" 14 g, that the temperature is 300°K, and the 
following data: 


z(cm) 

Number of particles 


100 


55 

0.0050 

31 

0.0075 

17 

0.0100 

9 


calculate the value of the Boltzmann constant. 

7-9. We can calculate the microcanonical ensemble partition function for a classical 
monatomic ideal gas in the following way. This partition function is given by 

£X£,A£) = ^ 5S J'-" J dp i dp 2 dq 3N 

where the asterisk indicates that one integrates over the region of phase space such that 

1 3 N 

E-*E<— 2 PJ 2 <E 

2m j= i ^ 

We have seen in the quantum mechanical case that the thermodynamic consequences of this 
equation are remarkably insensitive to the value of A E. (See Problem 3-14.) We can find H 
most readily by first evaluating 

I{E) 3N J* J* dpi dp-i dq$n 

where now the asterisk signifies the constraint 

1 3 N 

Note that Q(E, A E) is given by 1(E) — I(E— A E). The integration of dqi--- dq 3N in 1(E) 
immediately gives V", and the remaining integration over the momenta is just the volume of 
a 3AT-dimensional sphere of radius (2 mE) 112 . The volume of a 3 TV-dimensional sphere of 
radius R is (see Problem 1-24) 

t r 3 *' 2 

R 3N 

(37V/2)! 
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(Note that this reduces correctly when 37V = 2 and 3.) Using this formula then, show that 

7 T 3N * 2 V N (2mE) 3N ' 2 
/(£)= Nlh 3K (3N l 2)l 

is in agreement with Eq. (1-36). 

7-10. In Problem 3-14 we showed that the entropy could be calculated from k In Q(E)&E 
or k In $(£), where Q(E)&E is the number of states with energies between E and E 4 A E, and 
®(E) is the total number of states with energies less than E. In addition to this, we showed that 
the result is remarkably insensitive to the choice of A E. We shall now discuss the classical 
analog of this. In particular, this problem involves showing that the volume of an TV-dimen- 
sional sphere is essentially the same as the volume of the hypershell of thickness s. First write 
the volume of the hypersphere as 

FspheretR) = COnSt X R" 

Now show that if TV is large enough such that sN > R, then 

V sh 'n=V(R)~V(R^s) 

= const X R N ( 1 —e~ sN,R ) 

/v 1/ 

^ r sphere 

7-11. Show that in Cartesian coordinates, the Liouville equation takes the form of Eq. 
(7-30). 

7-12. Convince yourself that a corollary of Liouville’s equation is 
f(jP,<Et)=f(Po,qo;t 0 ) 

Although we did not discuss it explicitly, much of the kinetic theory of gases is contained in 
this chapter. Problems 7-13 through 7-25 develop some of the kinetic theory of gases. 

7-13. Consider a system of TV interacting molecules, whose vibrational degrees of freedom 
are treated quantum mechanically and whose translational and rotational degrees of freedom 
are treated classically with Hamiltonian 

TTclas* “ Atrans “I” ^rot 4 U 


where K represents kinetic energy, and U represents potential energy. Substitute this into 
Eq. (7-19); integrate overall the coordinates except the 37V translational momentum 
coordinates; and derive 


prob{JV lra „,} = 


* tfatls *r dpuzM 
JV*.™ 'l*T dpum 


Now realize that 


N 1 

TVtrans — ^2 ^ + P*J*) 

and derive the normalized Maxwell-Boltzmann distribution, namely, 

f(Px , p y , Pz) dp x dp y dp z = (lirmkT) ~ 3/2 e~ tp * 2 + p y 2 + p * 2)/2mkr dp x dp y dp* (7-40) 

One can derive all of the usual expressions of the kinetic theory of gases from this. 

7-14. An integral that appears often in statistical mechanics and particularly in the kinetic 
theory of gases is 


/„ = 


i; 


x" e~ ax2 dx 
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This integral can be readily generated from two basic integrals. For even values of «, we first 
consider 



The standard trick to evaluate this integral is to square it, and then transform the variables 
into polar coordinates: 

/ 0 2 = f°° f e-** 2 e-” 2 dx dy 
Jo Jo 

<*oo <* n / 2 

= I I e~ ar2 rdrd6 
Jo Jo 


7 T 

4 a 




1/2 


Using this result, show that for even n 
1-3 *5 -(w-^ 


/„ = 




n even 


2(2fl)" /2 

For odd values of n, the basic integral h is easy. Using h 9 show that 



2 a in + l)/2 


n odd 


7-15. Convert Eq. (7-40) (see Problem 7-13) from a Cartesian coordinate to a spherical 
coordinate representation by writing 

P 2 =Px 2 +p> 2 -\-Pz 2 

p z =p cos 6 
Px=P sin 6 cos <f> 

Py—P sin 6 sin <f> 

dp x dp y dp z -* p 2 sin 6 dp d6 d<f> 

and integrating over 6 and <f> to get 

f(p ) dp = ATT^hrmkT) ~ 3,2 p 2 e -* 2i2mkT dp 

for the fraction of molecules with momentum between p and p + dp. By substituting p = mv, 
we get the fraction of molecules with speeds between v and v + dv: 

( m \ 3/2 

ItHcT ) v 2 e~ mv2,2kT dv 

7-16. Prove that the most probable molecular speed is v mp = (IkT/m) 112 , that the mean 
speed is <y> = (8kT/7rmy ,2 f and that the root-mean-square speed is <i> 2 > 1/2 = (SkT/m) 112 . 
Evaluate these for H 2 and N 2 at 25°C. 

7-17. Show that the mean-square fluctuation of the velocity of the Maxwell-Boltzmann 
distribution is 
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7-18. Show that the average velocity in any direction (say x, y, or z) vanishes. What does 
this mean? 

7-19. Derive an expression for the fraction of molecules with translational energy between 
e and e + de from both Eqs. (7-40) and (5-7). 

7-20. According to Problem 1-35, the speed of sound in an ideal gas is given by 



where M is the molecular weight of the gas, and y = C p /C v . Show that c 0 = 0.81y for an 
ideal monatomic gas. 

7-21. Calculate the probability that two molecules will have a total kinetic energy between 
£ and £ + d£. 

7-22. Calculate the fraction of molecules with x-component of velocity between 
±n(ZkT/my ,2 9 where w = 1,2, and 3. Remember that the integral of exp(— x 2 ) with finite 
limits cannot be evaluated in closed form and is expressed in terms of the error function 
erf (x) by 

7-23. What is the average kinetic energy e and the most probable kinetic energy e mP of a 
gas molecule? 

7-24. Show that the number of molecules striking a unit area per unit time is pv/ 4, where 
p = N/V. 

7-25. How would you interpret the velocity distribution 

<tty) = ex pj~ ^ ~~ + ^ - + ^ ~ c ^} 

in which a, b, and c are constants? 

7-26. The relativistic dependence of the kinetic energy on momentum is 


e = c(p x 2 + p y 2 + p z 2 + m 0 2 c 2 y 12 


where m 0 is the rest mass of the particle, and c is the speed of light. Determine the thermo- 
dynamic properties of an ideal gas in the extreme relativistic limit, where pt> m 0 c. 

7-27. If an atom is radiating light of wavelength A 0 , the wavelength measured by an 
observer will be 


A = A °( 1 + c) 

if moving away from or toward the observer with velocity v z . In this equation c is the speed 
of light This is known as the Doppler effect. If one observes the radiation emitted from a gas 
at temperature T, it is found that the line at A 0 will be spread out by the Maxwellian distribu- 
tion of velocities v x of the molecules emitting the radiation. Show that /(A) d\ the intensity 
of radiation observed between wavelengths A and A + dX, is 


/(A) cc exp 


( mc \ A — A 0 ) 2 1 
( 2A 0 2 kT } 


This spreading about the line at A 0 is known as Doppler broadening. Estimate the Doppler 
line width for HCI radiating microwave radiation at room temperature. 

7-28. Plot C v in Eq. (7-39) versus temperature and see that the vibrational contribution 
does not contribute until the temperature approaches © v . 

7-29. Prove that if the Hamiltonian is given by Eq. (7-37), then each of the quadratic 
terms will contribute kT/2 to the average molecular energy and kjl to the molecular heat 
capacity. 
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7-30. Prove that even if the aj and bj in the Hamiltonian of Eq. (7-37) are functions of 
the momenta and coordinates not involved in the quadratic terms, the law of equipartition 
still applies. In particular, show how this more general version of the law of equipartition 
applies to the rigid rotor Hamiltonian. 

7-31. Let H(p, q) be the classical Hamiltonian for a classical system of TV interacting 
particles. Let Xj be one of the 37V momentum components or one of the 37V spatial coordinates. 
Prove the generalized equipartition theorem, namely, that 



and from this derive the principle of equipartition of energy that we discussed earlier. 
Hint: Realize that the potential U oo at the walls of the container. 

7-32. Consider a two-dimensional harmonic oscillator with Hamiltonian 


H =^(p* 2 +Py 2 ) + ^(x 2 + y 2 ) 


According to the principle of equipartition of energy, the average energy will be 2kT. Now 
transform this Hamiltonian to plane polar coordinates to get 





k 

+ r 


2 


What would you predict for the average energy now? Show by direct integration in plane 
polar coordinates that e = 2kT. Is anything wrong here? Why not? 

7-33. Convince yourself that the number of phase points passing through a face perpen- 
dicular to #1 per unit time is 


fqi &qi &q3 * * * &Pi 
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IDEAL POLYATOMIC GAS 


In this chapter we make a direct extension to polyatomic molecules of the methods 
and approximations used in the previous chapters for diatomic molecules and classical 
statistical mechanics. The introduction of the concept of normal coordinates will 
allow us to treat the vibrational problem of polyatomic molecules as a simple extension 
of the vibration of diatomic molecules (Section 8-1). We shall see, however, that the 
rotational problem must be treated by the method of classical statistical mechanics 
since, for most polyatomic molecules, the quantum mechanical rotational energy 
levels cannot be written in any convenient closed form, and most temperatures are 
such that classical statistical mechanics is applicable anyway (Section 8-2). One new 
feature that arises with polyatomic molecules is hindered internal rotation (in mole- 
cules such as ethane). This type of consideration is important in organic chemistry, 
where the results can be directly applied to a determination of the various conformers 
that exist in certain molecules. Hindered rotation is treated in Section 8-4. 

The discussion in Section 6-1 for diatomic molecules applies equally well to poly- 
atomic molecules. After making the Born-Oppenheimer approximation, we transform 
the Schrodinger equation that describes the motion of the nuclei in the Born- 
Oppenheimer potential into center of mass and relative coordinates. This allows us to 
write 


4^ 4^ I 4^ 

JC —JC trans + JC int 
® ^trans 4" ^inl 


Q ^trans^int 

where 


'2tiMA:T] 3/2 ¥/ 

h 2 


9 Ira ns 


(8-1) 
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and M is the total mass of the molecule. As before, the density of translational energy 
states alone is sufficient to guarantee that the number of energy states available to any 
molecule is much greater than the number of molecules in the system, and so 

Q(N, V, T) = g,f - " s ^ in - ‘ 


The Born-Oppenheimer potential, that is, the potential set up by the electrons moving 
in the field of the fixed nuclei, depends upon a number of internuclear distances. Each 
of the n atoms in a polyatomic molecule requires three coordinates to locate it. Thus it 
takes 3 n coordinates to specify the polyatomic molecule itself. Of these 3 n coordinates, 
3 are needed to specify the center of mass of the molecule. In addition, it requires 
two coordinates (i 9 and <j>) to specify its orientation if it is linear, and three coordinates 
if it is nonlinear. The remaining 3n — 5 or 3/z — 6 coordinates are internal coordinates 
that are required to specify the relative location of the n nuclei. Thus the Born- 
Oppenheimer potential of a polyatomic molecule depends upon 3n — 5 (linear) or 
3 n — 6 (nonlinear) coordinates. Since these coordinates specify the relative locations 
of the n nuclei, they are referred to as vibrational degrees of freedom. Similarly, since 
the two (linear) or three (nonlinear) orientation angles are used to specify the orienta- 
tion of the molecule about its center of mass, they represent rotational degrees of 
freedom. The three coordinates used to locate the center of mass of the molecule are 
translational degrees of freedom. 

As in the case of diatomic molecules, we use a rigid rotor-harmonic oscillator 
approximation. This allows us to separate the rotational motion from the vibrational 
motion of the molecule, and we can treat each one separately. Both problems are some- 
what more complicated for polyatomic molecules than for diatomic molecules. Never- 
theless, we can write the polyatomic analog of Eqs. (6-14) to (6-17): 


Q(N,V,T) = 


((Ztrans 9 rot Q\ib Qclcc ?nucl) 


N\ 


(8-2) 


We choose as the zero of energy all n atoms completely separated in their ground 
electronic states. Thus the energy of the ground electronic state is — D e . As before 
then, the electronic partition function is 

Select = “el e DJkT + ••’ (8-3) 

and we set q nucl = 1 . 

To calculate Q(N , V , T ) then, we must investigate # vib and q rot , and this is done in 
the next two sections. 


8-1 THE VIBRATIONAL PARTITION FUNCTION 

For a polyatomic molecule, the potential in which the nuclei vibrate is a function of 
3n — 5 or 3n — 6 relative coordinates. This potential is a generalization of Fig. 6-1 to 
3n — 5 or 3n — 6 dimensions, and thus is a complicated energy surface. As in the case 
of diatomic molecules, however, the amplitude of the nuclear vibrations is very small, 
and we can expand the potential function about its stable or equilibrium configuration. 
Let the Cartesian coordinates of each nucleus in the molecule be x u y u z l9 x 2 , . . . , 
x n , y n > z n- F° r small vibrations about the equilibrium configuration, we have a 
generalization of Eq. (6-3) to 3n — 5 or 3n — 6 relative coordinates, and the potential 
energy will be a quadratic function of 3n — 5 or 3n — 6 relative coordinates such as 
x 2 — x l9 x 3 — x lt y 2 — yi, and so on. When terms such as (x 2 — Xj) 2 are multiplied 
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out, there results cross terms of the form x x x 2 , XjX 3 , y x y 29 and so on. The presence 
of such cross terms makes the potential energy and hence the Hamiltonian a com- 
plicated mixture of x l9 x 2 , By the introduction of certain linear combinations of 
the Cartesian coordinates x lv x 2 , . . . , it is possible to eliminate the cross terms in the 

potential and to write it as a sum of squares of these new coordinates, say Q u Q 2 , 

In other words, it is possible to transform a Hamiltonian which contains cross terms 
in terms of x u x 2 , . . . , y l9 y 2 , . . . , to a Hamiltonian, which when written in terms of 
Qi, Qi, . becomes 


#e= - 


a 


I 


h 2 d 2 - 
dQ 2 + M 



(8-4) 


In this equation a is the number of vibrational degrees of freedom, that is, 3n — 5 
for a linear molecule and 3n — 6 for a nonlinear molecule, and pj and k } are effective 
reduced masses and force constants. 

Equation (8-4) is the Hamiltonian of a sum of independent harmonic oscillators, and 
so the total energy is of the form 


e 

where 


v ; 


I 




(nj + i)hvj 


1 

271 



nj = 0 , 1 , 2 , 


(8-5) 


There are straightforward methods to determine the Qf s that allow the Hamiltonian 
to be written as a sum of independent terms as in Eq. (8-4). The Qfs are called normal 
coordinates, and their determination for any particular molecule is called a normal 
coordinate analysis. The a fundamental frequencies Vj are obtained automatically in 
a normal coordinate analysis, but in practice they are usually determined spectro- 
scopically. The normal coordinates for a linear triatomic molecule (C0 2 ) are as shown 
in the following figure. 


<e 



* 


0 ^© — ® — » 


(a) 


(b) 




© — ® 


The mode labeled (a) is a symmetric stretch; the one labeled (b) is an asymmetric 
stretch; and the one labeled (c) is a bending mode. This mode is doubly degenerate, 
with one of the modes being in the plane of the page and the other being perpendicular 
to the page. 

For a nonlinear triatomic molecule such as water, we have the following three modes. 
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Because of Eq. (8-4), each normal mode of vibration makes an independent contribu- 
tion to thermodynamic functions such as E, C v , S, and so on. If the normal frequencies 
Vj, v 2 , . . . , v a , where a = 3n — 5 or 3n — 6, are known, we have immediately [see 
Eqs. (6-22) and (6-23)] equations such as 


e -B VJ IZT 


9vib ,P, (1 - <T e “' /r ) 

(8-6) 


(8-7) 

[fr) <i-V— y] 

(8-8) 


where 


© . 

^VJ 


hVj 


k 


Table 8-1 contains values of S vj for a number of molecules. 


Table 8-1 . Values of the characteristic rotational temperatures, the characteristic vibrational tempera- 
tures, and Do for polyatomic molecules* 


molecule 

©ro.(K) 



©v.b(K) 

jD 0 (kcal/mole) 

co 2 


0.561 


3360, 954(2), 1890 

381.5 

h 2 o 

40.1 

20.9 

13.4 

5360, 5160, 2290 

219.3 

nh 3 

13.6 

13.6 

8.92 

4800, 1360, 4880(2), 2330(2) 

276.8 

cio 2 

2.50 

0.478 

0.400 

1360, 640, 1600 

90.4 

so 2 

2.92 

0.495 

0.422 

1660, 750, 1960 

254.0 

n 2 o 


0.603 


3200, 850(2), 1840 

263.8 

no 2 

11.5 

0.624 

0.590 

1900, 1980, 2330 

221.8 

ch 4 

7.54 

7.54 

7.54 

4170, 2180(2), 4320(3), 1870(3) 

392.1 

ch 3 ci 

7.32 

0.637 

0.637 

4270, 1950, 1050, 4380(2), 

2140(2), 1460(2) 

370.7 

CC1 4 

0.0823 

0.0823 

0.0823 

660, 310(2), 1120(3), 450(3) 

308.8 


* These parameters were obtained from a variety of sources and do not necessarily represent the most 
accurate values since they are obtained under the rigid rotor-harmonic oscillator approximation. 
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8-2 THE ROTATIONAL PARTITION FUNCTION 


The rotational properties of a polyatomic molecule depend upon the general shape 
of the molecule. If the molecule is linear, such as C0 2 and C 2 H 2 , the problem is 
exactly the same as for a diatomic molecule. The energy levels are given by 


J(J + 1 )h 2 
Ej ~ Sn 2 I 
(Oj = 2J 4- 1 


J = 0,1,2, 


where, in this case, the moment of inertia I is 


(8-9) 


n 

I =Y m J d / ( 8 - 10 ) 

j= 1 

where dj is the distance of the yth nucleus from the center of mass of the molecule. 
Recall that the coordinates of the center of mass of a molecule are given by 


x - = h%r >x ‘ 

y - = h%r‘ y > 


l " 




( 8 - 11 ) 


where x - , yj , and z } are the Cartesian coordinates of the jth nucleus in an arbitrary 
coordinate system, and M = m 1 + m 2 + • • • + m n . (See Problem 8-1.) 

The rotational partition function of a linear polyatomic molecule is 


%7i 2 IkT T 


( 8 - 12 ) 


As before, we have introduced a symmetry number, which is unity for unsymmetrical 
molecules such as N 2 0 and COS and equal to two for symmetrical molecules such as 
C0 2 and C 2 H 2 . The symmetry number is the number of different ways the molecule 
can be rotated into a configuration indistinguishable from the original. Classically, it 
is a factor introduced to avoid over counting indistinguishable configurations in 
phase space. Table 8-1 gives © r for several linear polyatomic molecules. 

The energy calculated from Eq. (8-12) is kT y in accord with equipartition of energy, 
since there are two degrees of rotational freedom of a linear molecule. 

The moment of inertia is a fundamental property of rigid bodies. The rotational 
properties of a rigid body are characterized by the principal moments of the body, 
which are defined in the following way. Choose any set of Cartesian axes with origin 
at the center of mass of the body. The moments of inertia about these three axes are 

n 

h* = I m A(yj - ycm ) 2 + (zj - z cm ) 2 ] 

j= 1 

n 

lyy = ^ “ 1 ” fey ] 

J= 1 

n 

I zz — X! ^cw) “1" (j^j ycm) ] 

j= 1 
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In addition to these, there are also products of inertia, such as 

n 

lyy ~ ^ Jem) 

j= 1 

Now there is a theorem of rigid body motion that says that there always exists a 
particular set of Cartesian coordinates X , Y, Z, called the principal axes, passing 
through the center of mass of the body such that all the products of inertia vanish. 
The moments of inertia about these axes I xx , Jyy> and / zz are called the principal 
moments of inertia. The principal moments of inertia are customarily denoted by 

an ^ ^C' 

If the molecule possesses any degree of symmetry, the principal axes are simple 
to find. For example, if the molecule is planar, one of the principal axes will be 
perpendicular to the plane. Usually an axis of symmetry of the molecule will be a 
principal axis. The C-H bond of CHC1 3 is a three-fold axis of symmetry and also a 
principal axis. In general, however, it is not often necessary to calculate the principal 
moments of inertia, since there are extensive tables in the literature. They are usually 
given in terms of rotational constants in units of cm" 1 , defined by 


h 

inI A c 


h 

%nI B c 


C = 


h 

$7il c e 


( 8 - 1 3 ) 


from which it is an easy matter to calculate the corresponding rotational temperatures 
© x , © B , and © c . In these quantities, c is the speed of light. Table 8-1 contains values 
of €> a , © B , and © c for a number of polyatomic molecules. 

The relative magnitudes of the three principal moments of inertia are used to 
characterize the rigid body. If all three are equal, the body is called a spherical top; 
if only two are equal, the body is called a symmetric top; and if all three are different, 
the body is called an asymmetric top. Table 8-1 shows that CH 4 and CCI 4 are 
spherical tops; CH 3 C1 and NH 3 are symmetric tops; and H 2 0 and NO z are asym- 
metric tops. The quantum mechanical problem of the rotation of spherical tops and 
symmetric tops can be readily solved, but the rotation of an asymmetric top is fairly 
involved. 

We shall start with the easiest example, namely, that of spherical top / x = / B = / c . 
The quantum mechanical problem of a spherical top is readily solvable, having energy 
levels £j and degeneracy coj given by 

J(J + 1 )h 2 
Bj ~~ 21 

J = 0,1,2,... (8-14) 

tOj = (2 J + l) 2 

The high-temperature limit of the partition function is 


q [oi = - C(2J + i) 2 e -^+D 5 1 / 2 «r dJ 
a Jo 


(8-15) 
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Here again we have introduced a symmetry number a. The symmetry number for a 
polyatomic molecule is simply the number of ways that the molecule can be rotated 
“ into itself.” For example, a = 2 for H 2 0 and a = 3 for NH 3 . For methane, a = 12 
since there is three-fold symmetry about each of the four carbon-hydrogen bonds. 
Similarly, a = 4 for ethylene and 12 for benzene. Classically, the symmetry number 
avoids overcounting indistinguishable configurations in phase space. For those readers 
who know some group theory, o is the number of pure rotational elements (including 
the identity) in the point group of a nonlinear molecule. Since high temperature means 
that high values of J are important, we may neglect 1 compared to J in Eq. (8-15) 
and write 




(8-16) 


Problem 8-2 involves the derivation of Eq. (8-16) by evaluating the integral in 
Eq. (8-15) without neglecting 1 compared to J. 

The quantum mechanical problem of a symmetric top (I A = I B / I c ) is also solvable 
in closed form. In this case the energy levels depend upon two quantum numbers, one 
of which is a measure of the total rotational angular momentum of the molecule J, and 
the other a measure of the component of the rotational angular momentum along the 
unique axis of the symmetric top K, that is, the axis having the unique moment of 
inertia (customarily denoted by I c ). It might be pointed out here that any molecule 
with an w-fold axis of symmetry, with n > 3, is at least a symmetric top. The expression 
for the energy levels is 


'JK 


ft 2 U(J + 1) 
2 l I A 



where J = 0, 1, 2, . . .; K = J, J — 1, . . . , — J; and the degeneracy is 


Wjk ~~ ( 2*1 + 1 ) 


The partition function is, then. 


Qrot 


- f (2J + £ e -<‘c-«„)K* 

o J = 0 K= -J 


where 


ft 2 


Uj 2I ; kT 


j = A or C 


Problem 8-3 converts this to a double integral over J and K, which results in 

7t 1/2 /871 2 / . kT K /&~ 2 T-lrT\ 1/2 


/$7i I A kT\ /in I c kT\ 
q ' cl ~ o \ h 2 )\ h 2 ) 


(8-17) 


Notice that this reduces to the rotational partition of a spherical top [Eq. (8-16)] 
when I A = I c . 

The next case is that of an asymmetric top I A ^ Is ^ Ic • This is the most commonly 
occurring type of molecule. The quantum mechanical problem of the rotational levels 
of an asymmetric top is a fairly involved problem and must be solved numerically. 
Consequently a quantum-statistical treatment is awkward, and it is desirable to use 
classical mechanics. Even the classical Hamiltonian of a rigid asymmetrical rotor is 
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quite complicated. We confine ourselves, therefore, to the statement that insertion of 
the classical Hamiltonian into the classical phase integral leads, after a rather long but 
straightforward integration, to (see Problem 8-16) 


Qrol 


n 112 (Sn 2 I A kT\ l/2 (in 2 I B kT\ l/2 (Sn 2 I c kT\ 1/2 

h 2 M h 2 M h 2 ) 


( 8 - 18 ) 


Notice that this is a generalization of Eqs. (8-16) and (8-17). 

If we introduce the characteristic rotational temperatures into Eq. (8-18), we have 


71 


1/2 


(7rot 

G 



( 8 - 19 ) 


Table 8-1 contains 0^,0^, and © c for several molecules. The rotational contributions 
to some thermodynamic functions are 


E tot = iNkT 


(8-20) 

Cv, rot = iNk 


(8-21) 

[n 1/2 

S rot = Nk Ini — 

/ T 3 e 3 

(8-22) 

{ © A e B eJ \ 


Note that since there are three degrees of rotational freedom, the rotational kinetic 
energy here is 3NkT/2, in accord with equipartition of energy. 


8-3 THERMODYNAMIC FUNCTIONS 


We can now use the results of Sections 8-1 and 8-2 to construct q(V , T). We get 
for linear polyatomic molecules 


( 8 - 23 ) 


NkT 


(2nMkTy lz Y/ T f 3 ^ 5 \ ^ /kT 

( h 2 ) V ' <t©, ' ( (1 - e-W)\ Wel<P ' 

, [/2nMkT\ 3 ' 2 Ve] , / T \ 

~ ”11 h 2 ) JvJ + n W©J 

•?, [It + ln(1 “ e_e,,j/T) ] + ^ + ln (8-24) 



pV = NkT 


( 8 - 25 ) 

( 8 - 26 ) 


( 8 - 27 ) 

( 8 - 28 ) 
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and for nonlinear polyatomic molecules: 

/ 2 nMkT\ 312 n m / 
q ~\ h 2 ) ' a (©„ 


/ 2nMkT\ 3/2 n il2 / T 3 \ ,/2 l 3 ^ 6 e~ e ^ l2T \ 

h 2 j a Ux©»©J \R «-*+*&* 


£>.ivr 


NkT 


NkT 


, [2nMkT] 1 Ve , ( T* \ ,/z 

~ "L h 2 J N + ” a \©a©b©c) 

3 «-6 /0 . © . It \ D 

= 1 + 1 + + e e„j/T _ j) - -£f 


C„ 3w—6 /© \ 2 

m’ i + 1+ ?, (“f) - ly 




pV = NkT 


2nMkT ] 3/2 Fc 5/2 


7t 1/2 C 3 / 2 


3w — 6 r 
>=i L 


®vjlT 
e e oj/r __ j 


/ 7’'’ \*/' 4 

I©, ©B ©C/ 

ln(l - e _e “ j/T )| 


(8-29) 


(8-30) 


(8-31) 


(8-32) 


+ In tO el (8-33) 


(8-34) 


Table 8-1 contains the characteristic rotational temperatures, the characteristic 
vibrational temperatures, and 

D 0 = D e -Y,\hvj 

j 

for a number of polyatomic molecules. See Herzberg under “Additional Reading” for 
an excellent chapter dealing with practical statistical thermodynamical calculations 
for polyatomic molecules. He includes a number of corrections to the above formulas 
and discusses them numerically. 

Table 8-2 gives the vibrational contribution to the heat capacity for a variety of 
molecules of different shapes. It can be seen that the vibrational contributions are far 


Table 8-2. Vibrational contribution to heat capacities of some polyatomic molecules at 300°K 


total 

Cv/Nk 


molecule 






C'^INk 

(calc.) 

C0 2 , linear 

©XK) 

1890 

3360 

954 





degeneracy 

1 

1 

2 





contribution to C v /Nk 

0.073 

0.000 

0.458 


0.99 

3.49 

N 2 0, linear 

©*>( K) 

1840 

3200 

850 





degeneracy 

1 

1 

2 





contribution to C v /Nk 

0.082 

0.003 

0.533 


1.15 

2.65 

NH 3 , pyramidal 

©XK) 

4800 

1360 

4880 

2330 




degeneracy 

1 

1 

2 

2 




contribution to C v /Nk 

0.000 

0.226 

0.000 

0.026 

0.28 

3.28 

CH 4 , tetrahedron 

©XK) 

4170 

2180 

4320 

1870 




degeneracy 

1 

2 

3 

3 




contribution to C v /Nk 

0.000 

0.037 

0.000 

0.077 

0.30 

3.30 

H 2 0, isosceles triangle 

©XK) 

2290 

5160 

5360 





degeneracy 111 

contribution to C v /Nk 0.028 0.000 0.000 


0.03 


3.03 
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from their equipartition values at the temperatures listed and that the agreement be- 
tween the calculated and experimental values of C v jNk is excellent. A calculation for 
more complicated molecules would show similar agreement between the calculated 
values and the experimental data. 

Table 8-3 compares calculated values of the entropy to those measured calori- 
metrically. It can be seen again that the agreement with experiment is quite good. In 
fact, calculated values of the entropy are often more accurate than measured values, 
provided sophisticated enough spectroscopic models are used. 


Table 8-3. The entropy of several polyatomic gases at 25°C and 1-atm pressure* 



S(calc.) 

(e.u.) 

S(exp.) 

(e.u.) 

co 2 

51.1 

51.0 

nh 3 

46.1 

46.0 

no 2 

57.5 

57.5 

cio 2 

59.4 

59.6 

CH* 

44.5 

44.5 

CH 3 Cl 

55.8 

56.0 

ecu 

74.0 

73.9 

c 6 h 6 

64.5 

64.4 


* The experimental values have been corrected for nonideal gas behavior. 


There is, however, a class of molecules for which the type of agreement in Table 8-3 
is not found. For example, it is found that for carbon monoxide, S calc = 47.3 e.u. and 
S ex p = 46.2, for a discrepancy of 1.1 e.u. Other such discrepancies are found, and in 
all cases Scale > S cxp . This difference is often referred to as residual entropy. The 
explanation of this is the following. Carbon monoxide has a very small dipole moment, 
and so when carbon monoxide is crystallized, the molecules do not have a strong 
tendency to line up in an energetically favorable way. The resultant crystal, then, is 
a random mixture of the two possible orientations CO and OC. As the crystal is 
cooled down toward 0°K, each molecule gets locked into its orientation and cannot 
realize the state of lowest energy with £2 = 1, that is, all the molecules oriented in the 
same direction. Instead, the number of configurations £2 of the crystal is 2 N , since each 
of the N molecules exists equally likely (almost equally likely since the dipole moment 
is so small) in two states. Thus the entropy of the crystal at 0°K is S = k In £2 = Nk 
In 2 instead of zero. If Nk In 2 = 1.4, entropy units are added to the experimental 
entropy, the agreement in the case of carbon monoxide becomes satisfactory. If it 
were possible to obtain carbon monoxide in its true equilibrium state at T = 0, this 
discrepancy would not occur. A similar situation occurs with nitrous oxide. For 
H 3 CD, the residual entropy is 2.8 esu, and this is explained by realizing that each 
molecule of monodeuterated methane can assume four different orientations in the 
low-temperature crystal, and so S residual = Nk In 4 = 2.7 esu, in very close agreement 
with the experimental value. 


8-4 HINDERED ROTATION 

There is one extension or modification of the partition function of polyatomic 
molecules that we shall discuss in this section. In molecules such as ethane, one of the 
most important internal degrees of freedom is a rotation about the single carbon- 
carbon bond. Because of the interactions between the hydrogen atoms on each carbon, 
this rotation is not free, but is said to be restricted or hindered. As the two methyl 
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Figure 8-1. Potential energy of internal rotation in ethane. This curve can be represented approxi- 
mately by £F 0 (1 — cos 3<£). 

groups rotate about the carbon-carbon bond, the hydrogen atoms become alternately 
eclipsed (directly opposite each other) and staggered. The potential energy associated 
with this rotation is shown in Fig. 8-1. The maxima correspond to the configuration 
where the hydrogen atoms are eclipsed, and the minima correspond to the configura- 
tion in which they are staggered. At temperatures such that kT f> V 0 , the internal 
rotation is essentially free and can be treated by methods similar to the rigid rotor. 
At temperatures such that kT V 0 , the molecule is trapped at the bottom of the wells 
in Fig. 8-1, and the motion is that of a simple torsional vibration, which can be treated 
by a method similar to that used for the simple harmonic oscillator. Typical values of 
V 0 are such, however, that at ordinary temperatures, the motion is intermediate 
between that of free rotation and torsional vibration. 

It is necessary to solve the Schrodinger equation for the potential shown in Fig. 8-1 . 
This potential can be approximately represented by iV 0 (\ — cos 3 <j)), for which the 
Schrodinger equation is 

h 2 d 2 \ 1/ 

-8 + -<*»*«*—* 

where I r is an effective moment of inertia whose precise form we ~ shall not need. This 
differential equation is difficult to solve analytically, but the eigenvalues have been 
tabulated numerically as a function of V 0 . These can be used to compute a partition 
function for the restricted rotation, which can then be used to compute thermo- 
dynamic properties. There are extensive tables of the various thermodynamic functions 
as a function of V 0 jkT. 

Figure 8-2 shows a sketch of the contribution of internal rotation to the heat 
capacity in an ethanelike molecule. One can use curves such as these to fit heat capacity 



Figure 8-2. The contribution of internal rotation to the heat capacity as a function of barrier height V 0 . 
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Table 8-4. Potential barriers of some molecules 


molecule 

F 0 (kcal/mole) 

CH 3 CH 3 

2.7-3.0 

CH 3 CCI 3 

2.7 

CH 3 CH 2 C1 

2.1-A.l 

ch 3 oh 

1 . 1 - 1.6 

CH 3 SH 

1. 3-1.5 

ch 3 nh 2 

1.9 

CH 3 CH 2 gH 3 

3.3 

ch 3 chcch 2 

1.59-1.65 


data and thus obtain a thermodynamic estimate of V 0 . One can also generate curves 
for the entropy as a function of V 0 and hence determine V 0 by comparing the results to 
experimental values of the entropy. The values of V 0 determined from heat capacity 
data and entropy data are in fair agreement. There are a number of other ways of 
determining V 0 (see e.g. Section 9-2(F)) and the reader is referred to a review article 
by Wilson* for a general discussion of the problem of barriers to internal rotation in 
molecules and a comparison of the various methods for determining V 0 experi- 
mentally. Table 8-4 lists potential barriers for several molecules. The range of values 
given for each molecule is an indication of the agreement between different experi- 
mental methods of determining V 0 . 

ADDITIONAL READING 
General 

Davidson, N. 1962. Statistical mechanics . New York: McGraw-Hill. Chapter 11 . 

Fowler, R. H., and Guggenheim, E. A. 1956. Statistical thermodynamics. Cambridge: Cambridge 
University Press. Sections 326 through 331. 

Hill, T. L. 1952. Statistical thermodynamics . Cambridge: Cambridge University Press. Chapter 9. 
Kubo, R. 1965. Statistical mechanics. Amsterdam: North-Holland Publishing Co. Sections 3-1 to 
3-3. Example 3-2. 

Mayer, J. E., and Mayer, M. G. 1940. Statistical mechanics. New York: Wiley. Chapter 8 . 
Munster, A. 1969. Statistical thermodynamics , Vol. I. Berlin: Springer-Verlag. Sections 6-8 through 
6 - 11 . 

Rushbrooke, G. S. 1949. Statistical mechanics. London: Oxford University Press. Chapter 9. 
Spectroscopy 

Barrow, G. M. 1962. Introduction to molecular spectroscopy . New York: McGraw-Hill. 

Costain, C. C. 1970. In Physical chemistry : an advanced treatise , Vol. IV, ed. by H. Eyring, D. 
Henderson, and W. Jost. New York: Academic. 

Hall, J. R. 1970. In Physical chemistry , an advanced treatise, Vol. IV, ed. by H. Eyring, D. Henderson, 
and W. Jost. New York: Academic. 

Herzberg, G. 1945. Infrared and Raman spectra of polyatomic molecules. New York: Van Nostrand. 
King, G. W. 1970. In Physical chemistry, an advanced treatise, Vol. IV, ed. by H. Eyring, D. Henderson, 
and W. Jost. New York: Academic. 

. 1964. Spectroscopy and molecular structure. New York: Holt, Rinehart & Winston. 

PROBLEMS 

8-1. The HOH bond angle in water is 104°, and the OH bond length is 0.96A. Calculate 
the center of mass and the three moments of inertia of water. From this verify the results 
for © rot for H 2 0 given in Table 8-1. 

8-2. Evaluate Eq. (8-16) without neglecting 1 compared to / in Eq. (8-15). 

8-3. Derive Eq. (8-17) from its corresponding summation by converting the sum to an 
integral. 


* E. B. Wilson, Jr., Adv. Client. Phys., 2, p. 367, 1959. 
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8-4. Verify the calculated entries in Table 8-3 for CH 4 , using the data in Table 8-1. 

8-5. Use the data in Table 8-1 to calculate the entropy of C0 2 at 25" C and 1 atm. Compare 
your result to that in Table 8-3. 

8-6. The same as Problem 8-5, but for H 2 0. 

8-7. The same as Problem 8-5, but for CH 4 . 

8-8. What molar heat capacities would you expect under classical conditions for the fol- 
lowing gases: (a) Ne, (b) 0 2 , (c) H 2 0, (d) C0 2 , and (e) CHCI 3 ? 

8-9. Verify the results for methane in Table 8-3. 

8-10. Verify the results for ammonia and water in Table 8-2. 

8-11. Calculate the entropy of CI0 2 at 298°K and compare this to the experimental value 
of 61 e.u. 

8-12. Verify from group theoretic character tables that the symmetry number is equal to 
the number of pure rotational elements (including the identity) in the point group of the 
molecule. 

8-13. Show that C v for NH 3 at 300 K is 3.3 Nk. 

8-14. In Problem 1-36, heat-capacity data were listed for a calculation of the third-law 
entropy of nitromethane. From the following molecular data, calculate the statistical entropy 
S289 • Bond distances (A): N-O 1.21; C-N 1.46; C-H 1.09. Bond angles: O-N-O 127°; 
H-C-N 109$°. From these distances, calculate the principal moments of inertia, I — 67.2, 
76.0, 137.9 x 10" 40 g-cm 2 . The fundamental vibration frequencies in cm -1 are 476, 599, 647, 
921, 1097, 1153, 1384, 1413, 1449, 1488, 1582, 2905, 3048(2). One of the torsional vibrations 
has become a free rotation around the C-N bond with I = 4.86 x 10“ 40 . 

8-15. The classical rotational kinetic energy of a symmetric top molecule is 

K _p> 2 ( p* - p* c ° s A 2 p » 2 

~ 21 A + 21 A sin 2 0 + 21c 

where l a, l a, and l c are the principal moments of inertia, and 0, <f>, and >p are the three Euler 
angles. Derive the classical limit of the rotational partition function for a symmetric top 
molecule. Hint: Recall that the Euler angles have the ranges: 

O^0<7T 
0 < <f> < 27T 
0 < *p < 2tt 

8-16. The classical Hamiltonian for an asymmetric top molecule with principal moments 
of inertia I A , I B , and I c is given by 

H — — — {( p <t> —p* cos 0) cos ib — pe sin 0 sin &Y 
21 A sin u 

+ or *- 2 a Kp* - p* cos 0) sin \}> + p e sin 0 cos <p) 2 + p* 2 

21b Sin v 21c 

Derive the classical limit of the rotational partition function for an asymmetric top molecule. 
Hint: It may help to rearrange the Hamiltonian and integrate over pe,p*, p* in that order. 
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CHEMICAL EQUILIBRIUM 


One of the most important chemical applications of statistical thermodynamics is the 
calculation of equilibrium constants in terms of molecular parameters. Often the results 
of such calculations are more accurate than the experimental values. In Section 9-1 
we derive the basic equations, giving the equilibrium constant in terms of partition 
functions. This is an easy derivation, since Eq. (1-68) gives the thermodynamic equi- 
librium condition in terms of chemical potentials, and in Chapters 5, 6, and 8 we 
derived thermodynamic functions such as the chemical potential in terms of partition 
functions. Thus there are no new principles introduced in this chapter. In a sense, it is 
simply a numerical application of the results of the previous chapters. The bulk of this 
chapter is a numerical discussion of the few equations of Section 9-1. In Section 9-2 
we discuss six types of reactions: (1) the association of atoms or molecules in a vapor, 
(2) a simple isotopic exchange reaction, (3) a more complicated isotopic exchange reac- 
tion, (4) a chemical reaction involving only diatomic molecules, (5) a chemical reaction 
involving polyatomic molecules, and (6) a chemical reaction involving a molecule with 
restricted internal rotation. Then in Section 9-3 we discuss the use of thermodynamic 
tables to calculate equilibrium constants. We shall see that thermodynamic tables are 
equivalent to a tabulation of the partition function at various temperatures. 


9-1 THE EQUILIBRIUM CONSTANT 

IN TERMS OF PARTITION FUNCTIONS 

We shall consider the general homogeneous gas phase chemical reaction 

v a A + v b B^± v c C+ v d D (9-1) 

at equilibrium in a closed thermostated vessel. The v’s are stoichiometric coefficients 
and A, B, and so on, represent the reactants and products. The thermodynamic condi- 
tion for chemical equilibrium is derived as follows. 
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We first write Eq. (9-1) algebraically as 

v c C 4- v d D — v a A — v b B — 0 (9-2) 

We then define a variable k such that dNj = Vj dk, where j = A, B, C, or D and where 
Vj is taken to be positive for products and negative for reactants. The Helmholtz free 
energy of the system is 

dA = -SdT -pdV + Y J VjdN j 

j 

For a reaction vessel at fixed volume and temperature, 

dA=Yj Vj dNj = ^ dk (constant T and V) (9-3) 

For a system at equilibrium, the free energy must be a minimum with respect to all 
possible changes dk, and so (dA/dk) T v = 0. From Eq. (9-3) then, we have the condi- 
tion for chemical equilibrium: 

X Vj Hj = VcHc +v D n D - v A fi A -v B fi B = 0 (9-4) 

j 

This is the general thermodynamic equation of chemical equilibrium. We shall now 
introduce statistical thermodynamics through the relation between chemical potential 
and partition functions. In a mixture of ideal gases, the species are independent and 
distinguishable, and so the partition function of the mixture is a product of the parti- 
tion functions of the individual components. Thus 


Q(N a ,N B9 N c ,N d , V, T ) = Q(N A9 K T)Q(N b , V, T)Q(N C9 V 9 T)Q(N D9 V, T ) 

qAY, Tf- q^v, T) Nb qdiV, T)" c q D (V, T)"° 


N A l N b \ N c l N D l 
The chemical potential of each species is given by an equation such as 


(9-5) 


Va = ~kT 


f d In Q \ 

\ &N a / nj.v.t 


-kT\n 


T) 

n a 


(9-6) 


where Stirling’s approximation has been used for N A l. The Nj subscript on the partial 
derivative indicates that the numbers of particles of the other species are held fixed. 
Equation (9-6) simply says that the chemical potential of one species of an ideal gas 
mixture is calculated as if the other species were not present. This, of course, is obvious 
for an ideal gas mixture. 

If we substitute Eq. (9-6) into Eq. (9-4), we get 


N c vc N d v ° = g c Vc g D VD 

NS-NtT q A ^q B ' B 

For an ideal gas, the molecular partition function is of the form f(T)V (see Problem 
4-15), so that qjV is a function of temperature only. This allows us to write 


Pc c Pd° (qclVr c (q D IVy° 
pa va pb b {ajjvy^qjvy 


(9-8) 


where K C (T) is the equilibrium constant of the reaction. For an ideal system, K c is a 
function of temperature only. 
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Another commonly used equilibrium constant is K p (T ), which is expressed in terms 
of partial pressures rather than concentrations. We can derive the equation for K p (T) 
by substituting p s = PjkT into Eq. (9-8): 

K p (T) = ~P^- B = {kTy c+ ' ,D ~ VA ~' ,B K c {T) (9-9) 

Pa Pb 

By means of Eq. (9-8) or (9-9), along with the results of Chapters 5 through 8, it 
is a simple matter to calculate equilibrium constants in terms of molecular parameters. 
This is best illustrated by means of examples. 


9-2 EXAMPLES OF THE CALCULATION 
OF EQUILIBRIUM CONSTANTS 


A. THE ASSOCIATION OF ALKALI METAL VAPORS 

We use, as an example, the reaction 

2Na^±Na 2 

The equilibrium constant for this reaction can be written as 


K p (T)= f dimef (9-10) 

P monomer 

- (tT, "‘(W^ <9 -" ) 

The equation for the partition function of a monatomic ideal gas [Eq. (5-15)] and a 
diatomic molecule [Eq. (6-51)] are 


#Na(^ K) “ 

^27rm Na fcTj 

3/2 

Vq tU c(T) 

(9-12) 


/'27tm Naj kT' 

| 3/2 f 8 n 2 JkT e-™ 2 


\ h 2 , 

) V 2 h 2 (1 — e~ phv ) <t>1 * e 


= 

^27T771 Na2 kT' 

\ h 2 . 

| 3/2 p^2^") (1 — e“®*' /T )- 1 e Do/ * T 

(9-13) 


Note that we have introduced D 0 = D e — into Eq. (9-13). We shall usually do 
this since there are extensive tables of D 0 . Tables 6-1 and 8-1 contain values of D 0 
for diatomic and polyatomic molecules. In addition to values of D 0 , Table 6-1 con- 
tains the other parameters needed in Eq. (9-13). From Table 6-1, ©„ = 229 0 K, 
0 r = 0.221 °K, and D 0 = 17.3 Kcal/mole. In addition, we need to know that the ground 
electronic state of a sodium atom is 2 S 1/2 , and that the next electronic state lies 
approximately 16,000 cm -1 above the ground state. At 1000°K, then 

q^a (2n x 23 x 1.66 x 10" 24 x 1.38 x 10" 16 x 10 3 \ 3/2 
~V ~ \ 6.626 x 6.626 x 10“ 54 / 9elec 

= (6.54 x 10 26 ) x 2 = 1.31 x 10 27 (9-14) 

= (1.85 x 10 27 ) x (2.26 x 10 3 )(4.88)(5.96 x 10 3 ) 

= (1.22 x 10 35 ) 


(9-15) 
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and the equilibrium constant is 


K P {T) = 


1.22 x 10 35 

(1.38 x 10 -16 )(1000)(1.72 x 10 54 ) 


= 0.50 x 10 6 (dyne/cm 2 ) 1 =0.50 atm 1 


(9-1 6) 


The experimental value is 0.475 atm“ We have used the fact that 1 atm = 
1.01 x 10 6 dynes/cm 2 . This can easily be calculated from the gas constant or the 
Boltzmann constant in ergs/deg-mole and liter-atm/deg-mole. Note that K p has units 
of 1 /(pressure), even though it is a function only of the temperature for an ideal system. 
Table 9-1 gives K p (T ) at a number of other temperatures. The agreement is seen to be 
good. 


Table 9-1 . A comparison of the experimental values of the equilibrium constant for the reaction 
2Na ^Na 2 with the values calculated from Eq. (9-11) 


T(°K) 

tf p (calc.)(atm) -1 

X„(exp.)*(atm) _1 

900 

1.44 

1.32 

1000 

0.50 

0.47 

1100 

0.22 

0.21 

1200 

0.1 1 

0.10 


* See C. T. Ewing, et al. t J. Chent. Phys ., 71, 473, 1967. 


A general principle of dissociation or association reactions of this type is that the 
dimer is favored by the energetics of the reaction but the monomers are favored by 
the entropy. Problems 9-2 and 9-3 involve similar calculations for the association of 
potassium vapors and the dissociation of I 2 . 


B. AN ISOTOPIC EXCHANGE REACTION 

We consider the isotopic exchange reaction 


H 2 + D 2 ^±2HD 


We can use this simple reaction to illustrate the consequences of the Born-Oppenheimer 
approximation in isotopic exchange reactions. The Born-Oppenheimer approximation 
is based upon the approximation that the nuclei are so much more massive than the 
electrons that it is legitimate to calculate the electronic state of a molecule in a field of 
fixed nuclei. Thus H 2 , D 2 , and HD all have the same internuclear potential function, 
and therefore have the same force constant k 9 the same depth of the potential D e , 
and the same internuclear separation. This leads to a great deal of canceling between 
the numerator and denominator in the ratio of partition functions in the equilibrium 
constant. 

The equilibrium constant is 


K(T) = KJT) = K C (T) = 


Phd" 


Phd 


Qhd 


Ph 2 Pd 2 Ph 2 Pv 2 4h 2 4d 2 
( 2nm HD kT \ 3 ( T \ 2 1 e~ e ^ 2T \ 2 

\ h 2 ) \® r ,,J \1 - e- e -™ /T ) 


( 2nm Hl kT \ 312 pnm^kT Y* 12 ( T 2 \ 1 e~ & ^ T \ / e -«W2T \ ^ 

\ h 2 ) \ h 2 ) \40 ri H 2 © r .D 2 /\l - - e- e ^ IT ) 

wihd 3 4© r , H2 0 r , Dl (1 - g ~WT )(1 _ e -WT) 


(m H 2 m Dj ) 3/2 © r 2 ,HD 


(1 


_ e -®u,HD/rj2 


^ — (2 ©„,hd - ©w,h 2 — ®v,d 2 )I 2 T 


(9-17) 
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From Table 6-1, 0, Hi = 85.3°K, 0 rt D2 = 42.7°K, 0, ?H2 = 6215°K, and 0„,d 2 = 
4394°K. The values for HD are easy to calculate from these, since S v = hvjk , and 
v = (kln) llz l2n 9 where fi is the reduced mass of the molecule. Since the force constants 
of all three molecules are equal under the Born-Oppenheimer approximation, we have 


v hd _ /^h 2 \ 
v H2 VW 


1/2 


1/2 


or 


®v,HD _ /VH 2 \ 

®v, h 2 V*hd/ 

which gives 

®r, HD = (J) 1 , 2 ©i; # H 2 

Of course, we also have the relations 

©„.hd = (D 1 / 2 ©,.d 2 and ©„,D 2 = (i) 1/2 ©„, Hj 

Similarly, since © r — h 2 jZn z Ik and / = \ir z , where r e is the internuclear separation, 
we can write 


(9-18) 


(9-19) 


(9-20) 


®r, HD _ A*H 2 _ 3 
®r, H 2 A*HD 4 


(9-21) 


or 


®r. HD _ 3 
®r,D 2 “2 


(9-22) 


Note that 0 r H2 = 20 r ^ Di ■ We shall calculate K{T) between 195°K and 741°K. At 
these temperatures, the (1 — e~ evlT ) factors in Eq. (9-17) are approximately unity. 
Substituting all of these reduction formulas into Eq. (9-17), we get 

K(T) = 4(|) 3 / 2 (|.)g-c 3,/2 - 1 -^/ 2 ) 1/2 i e ^H 2 /2T 


= 4(1.06)exp 



(9-23) 


Table 9-2 compares Eq. (9-23) with the experimental data of Urey and Rittenberg, 
obtained during Urey’s early investigations on heavy hydrogen. Equation (9-23) 
shows that the product or the symmetry numbers of the H 2 and D 2 predominates in 
the calculation of the equilibrium constant. All of the other factors are close to unity. 

The hydrogen-deuterium exchange reaction, in fact, shows an unusually large 
deviation from K — 4, since the percentage mass difference between hydrogen and 


Table 9-2. A comparison of the experimental values of the equilibrium constant of the reaction 
H 2 +D 2 ^ 2HD with the values calculated from Eq. (9-23) 


7X°K) 

tf(calc.) 

*(exp.) 

195 

2.84* 

2.92 

273 

3.18 

3.24 

298 

3.26 

3.28 

383 

3.46 

3.50 

543 

3.67 

3.85 

670 

3.77 

3.8 

741 

3.81 

3.75 


* Quantum effects begin to be important at these low temperatures. If these effects are included, a value 
of 2.87 is obtained. 

Source : D. Rittenberg, W. Bleakney, and H. C. Urey, J. Client. Phys ., 2, p. 362, 1934, and A. J. Gould, W 
Bleakney, and H. S. Taylor, J. Client. Phys., 2, p. 362, 1934. 
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deuterium is large. For other diatomic isotopic exchange reactions such as 


N£ 4 + NJ 5 ^ 2N 14 N 15 , the equilibrium constant differs very little from the value 4. 


It is easy to derive a general expression for the equilibrium constant of such reactions 
by expanding the complete expression for K(T) in a power series in A/M, where A 
is the difference in atomic mass between the two isotopes, and M is the atomic mass of 
the heavier isotope. To order (A/M) 2 , the result is (see Problem 9-6) 


K(T) = 4 




A2© MtVlb /32Af2T 


(9—24) 


where © Mvib is the characteristic vibrational temperature of the diatomic molecule 
containing two heavier nuclei. For the N] 4 + N] 5 ^ 2N 14 N 15 reaction, A/M = 1/15 
and 0 M vib = 0.97 times the value in Table 6-1, or © M# vib = 3260°K. Equation (9-24) 
becomes 

K(T) = 4(1.0005)e _o,44/T (9-25) 

which is essentially equal to 4 for all temperatures greater than 100°K, say. Thus the 
equilibrium constant is completely determined by the symmetry of the molecules 
involved in the reaction. This is not necessarily so for more complicated molecules, 
however, as the next section shows. 


C. A MORE COMPLICATED ISOTOPIC EXCHANGE REACTION: 

AN ILLUSTRATION OF THE TELLER-REDLICH PRODUCT RULE 

Consider the exchange reaction 
CH 4 + DBr ^CH 3 D + HBr 

Because of the Born-Oppenheimer approximation, the internuclear potential surfaces 
of CH 4 and CH 3 D are the same, and the internuclear potential energy curves of HBr 
and DBr are the same. As before, this leads to a great deal of simplification in the ratio 
of partition functions. The equilibrium constant for this reaction is 

p(CH 3 D)p(HBr) g(CH 3 D)g(HBr) 

1 J p(CH 4 )p(Br) s(CH 4 MDBr) 


°CH 4 °PBr / ^CH 3 P^HBr \ 3/2 ^HBr (J A^ B^c)cH3^> flvib,CH 3 p(Zvib t HBr 
°CH 3 D°HBr \ ^CH 4 ^DBr / ^DBr Ua^B^c)cH 4 4vib.CH 4 4vib,DBr 


Mch 4 Mdb. 


(9-26) 


The fundamental vibrational frequencies of CH 4 , CH 3 D, HBr, and DBr are given 
in Table 9-3. All of these frequencies are high enough that the factors (1 - e~ eJT ) in 

Table 9-3. Fundamental vibrational frequencies of CH 4 , CH 3 D, HBr, and DBr 


frequency 


2917 cm" 1 
1534 
3019 
1306 


degeneracy frequency degeneracy 


1 2200 cm" 1 1 

2 2945 1 

3 1310 1 

3 1471 2 

3021 2 

1155 2 


frequency 


degeneracy frequency degeneracy 


2650 cm 


1880 cm 
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the vibrational partition functions are essentially unity at room temperature. The ratio 
of the vibrational partition functions in Eq. (9-26) includes then only the zero-point 
vibrational energy terms and is 

gvib,CH 3 pgvib,HBr _ CXpf ]T 
(Zvib»CH 4 <?vib.DBr l j 


2 T 


= exp 


220 cm 1 
k kT 


(9-27) 


using the frequencies given in Table 9-3. 

The ratio of the symmetry numbers in Eq. (9-26) is 12 x 1/(3 x 1) = 4. To calculate 
the remainder of Eq. (9-26), we could look up all the moments of inertia we need in 
the literature, but this is not necessary. We can avoid it by using the Teller-Redlich 
product rule for isotopically substituted compounds, which says that if molecules A 
and A ' differ only by isotopic substitution, then 


r M '\ 3 ' 2 V 

jr) t 


n /m'\ 3/2 3 " -5 v-' 

n R-) n f 

i=i Xmj j=i Vj 


for linear molecules and 


(linear) 


(9-28) 


/m 3 ' 2 (I A I B 'I c ') m A (m\\ 312 3n fT 6 
\m) (I A I B I c ) l/2 k\\mj M 


(nonlinear) 


(9-29) 


for nonlinear molecules. In these two equations, M denotes the total mass of molecule 
A; rrij is the mass of the yth atom; and Vj is the j fundamental vibrational frequency. 
The proof of the Teller-Redlich product involves a detailed consideration of the 
normal coordinates of molecules, and so we shall not prove it here. It is easy to show, 
however, that it is valid for a diatomic molecule under the Born-Oppenheimer approxi- 
mation (see Problem 9-7). 

The Teller-Redlich product rule is useful to us, since we already have the Vj in 
Table 9-3. Furthermore, when this is used in Eq. (9-26), the products of the atomic 
masses cancel, and we are left with the simple result: 

K(T) = 4e 3il/T Y\ (^5) 

V DBr j=l \ v j CH 4 / 

= 2.99c 3 17/T 
= 8.65 at 298°K 


(9-30) 

(9-31) 


D. A CHEMICAL REACTION INVOLVING DIATOMIC MOLECULES 

We shall calculate the equilibrium constant for the reaction 


H 2 + I 2 — 2HI 

from 500°K to 1000°K. The equilibrium constant is given by 


K(T) = 




9 HI 


(QhJ ^0 (<Zi 2 / ^0 9h 2 Q 12 

wi w 2 \ 3/2 /4© r H2 © r ,2 \ (1 - e-o^l - c _0 “ ,2/r ) 


= / \ 
\m Hl mj 


" 2 © 12 \ 

ll©7¥") 


(1 _ 2 

(2D0 111 — D 0 H2 — D 0 l2 ) 


x exp 


RT 


(9-32) 


All of the necessary parameters are given in Table 6-1. Figure 9-1 shows In K plotted 
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1000 IT 

Figure 9-1. The logarithm of the equilibrium constant versns \/T for the reaction H 2 + I 2 ^2HL The 
points are calculated from Eq. (9-32) and the crosses are the experimental values. (From 
A. H. Taylor and R. H. Crist, J, Am. Chem , Soc ., 63, 1377, 1941.) 

versus 1/T. The thermodynamic equation for the variation of the equilibrium constant 
with temperature is 

rf(lnK)=-^</(i) (9_33) 

where AH is the heat of reaction. From Fig. 9-1 we get AH = —3100 cal/mole, com- 
pared to the experimental value of —2950 cal/mole. 

Problems 9-8 and 9-9 involve the calculation of the equilibrium constant for some 
other reactions involving only diatomic molecules. 


E. A REACTION INVOLVING POLYATOMIC MOLECULES 

As an example of a reaction involving a polyatomic molecule, consider the reaction 
H 2 + i0 2 ^±H 2 0 
whose equilibrium constant is 


K , t n _ (^h 2 o/K) 

A ’ 0 kT) 1/2 (g H JV)(q 0l IV ) 1/2 


(9-34) 


It is almost as convenient to calculate each partition function separately as to sub- 
stitute them into K p first. Furthermore, we shall use the values in the next section. 

The necessary parameters are given in Tables 6-1 and 8-1. At 1500°K, the three 
partition functions are 




= 2.80 x io 26 e D ° H2/RT 


(9-35) 


^ - (^’"^5)0 - WVMH- 

= 2.79 x lo 30 ^ 0 ’/^ 


(9-36) 
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and 

g Hl Q _ ( 2nm Hl0 kT \ 3/2 / T 3 

v \ h 2 ) a [ e ”* 0 ®** 0 ®”* 0 

= 5.33 x 10 29 e D ° H2 ° /KT (9-37) 

The factor of 3 occurs in q 0 J V because the ground state of O z is . (See Table 6-1 .) 

At 1500°K, the equilibrium constant is 

K p = 4.77 x 10 5 (atm) _,/2 (9-38) 

Table 9-4 compares the calculated values of log K p with the experimental data of 


Table 9-4. The logarithm of the equilibrium constant for the reaction H 2 + i0 2 ^H 2 0 


7TK) 

log Alcaic.) 

log AXexp.)* 

1000 

10.2 

10.1 


5.7 

5.7 

2000 

3.7 

3.5 


* See H. Zeiss, Z. Electrockem., 43, p. 706, 1937. 


Zeiss. The agreement can be considerably improved by using more sophisticated 
spectroscopic models. At high temperatures, the rotational energies of the molecules 
are high enough to warrant centrifugal distortion effects and other extensions of the 
simple rigid rotor-harmonic oscillator approximation. Problems 9-10 through 9-12 
involve similar calculations. 

F. A CHEMICAL REACTION INVOLVING A MOLECULE WITH RESTRICTED 
INTERNAL ROTATION, ETHYLENE-ETHANE EQUILIBRIUM 

As a final example, we discuss the equilibrium constant for the reaction 

c 2 h 4 + h 2 ^±c 2 h 6 

This calculation is more complicated than the previous ones because of the hindered 
rotation in ethane. The equilibrium constant is calculated for various values of the 
barrier height V 0 , and V 0 is chosen to give the best agreement with experiment. 
Table 9-5 gives some values for the equilibrium constant K p for this reaction at various 


Table 9-5. The equilibrium constant for the reaction C 2 H 4 +H 2 ^C 2 H 6 calculated with 
Vo — 3100 kcal/mole 



A: p (atm“ 

calc. 

‘) 

obs. 

673 

1.1 x 10* 

1.2 x 10 4 

773 

4.3 x 10 2 

4.2 x 10 2 

873 

33.5 

32.0 

973 

4.5 

5.0 


Source : E. A. Guggenheim, Trans. Faraday Soc ., 37, p. 97, 1941. 


temperatures as calculated by Guggenheim.* The value of V 0 that seemed to give the 
best agreement was 3.1 kcal/mole. This value agrees fairly well with other determina- 
tions of V 0 for ethane. This is another method for determining barrier heights and 
some values determined by this method are included in Table 8-4. 


1/2 3 

n (1 - e 
1=1 


Kj*l°ITy l^Do 1 


* E. A. Guggenheim, Trans. Faraday Soc., 37, 97, 1941. 







THERMODYNAMIC TABLES 


151 


9-3 THERMODYNAMIC TABLES 

In the previous section we have seen that the rigid rotor-harmonic oscillator approxi- 
mation can be used to calculate equilibrium constants in reasonably good agreement 
with experiment, and because of the simplicity of the model, the calculations involved 
are not extensive. If greater accuracy is desired, however, one must include corrections 
to the rigid rotor-harmonic oscillator model, and the calculations become increasingly 
more laborious. It is natural, then, that a number of numerical tables of partition 
functions has evolved, and in this section we shall discuss the use of these tables. 
These tables are actually much more extensive than a compilation of partition func- 
tions. They include many experimentally determined values of thermodynamic 
properties, often complemented by theoretical calculations. The thermodynamic 
tables we are about to discuss in this section, then, represent a collection of the thermo- 
dynamic and/or statistical thermodynamic properties of many substances. In order 
to fully appreciate the use of these tables, it is necessary to express the equilibrium 
constant in terms of the standard chemical potential ji 0 (T) and then to discuss at 
length the problem, or rather the convention, of the choice of the zero of energy. 

First we shall derive the relation between the equilibrium constant and the standard 
chemical potential [cf. Eq. (1-68)]. Equation (9-4) gives the thermodynamic condition 
for chemical equilibrium: 

VcHc + v d Hd - v a Ha - v„Hb = 0 

For an ideal gas, the chemical potential is of the form [cf. Eq. (1-66) or Eq. (5-21)] 
H(T, p) = p 0 (T) + kT In p (9-39) 

If we substitute this into Eq. (9-4), we get 

lnK p=-^f O- 40 ) 

where 

Afl 0 = V C l*OC + VdVoD “ v Af i OA ~ Vb^OB (9-41) 

The fi 0 are related to the molecular partition function in the following way. For an 
ideal gas, Q = q N /Nl. The chemical potential is given by —kT(d In Q/dN) VtT , so 

-kT\n[j^ (9-42) 

For an ideal gas, the molecular partition function is a function of temperature times 
the volume, and so qjV is a function of temperature only. We can rewrite Eq. (9-42) as 

- kT] "#) $ 

and use the ideal gas equation of state for V/N to get 

p = -kT ln|^|) + kT\n p (9-43) 

This equation is of the form of Eq. (9-39) with p 0 {T) given by 
MT)=-A:Tln|^A:r} 


(9-44) 
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The argument of the logarithm here has units of pressure, but of course the units 
cancel when this term is combined with kT In p. Nevertheless, the numerical value of 
^o(T) does depend upon the units of pressure. In anticipation of the convention used 
in the thermodynamic tables, we shall express the pressure in atmospheres. If we use 
the fact that 1 atm = 1.01 x 10 6 dynes/cm 2 , we can write Eq. (9-43) in the form 

n = -kT ln{^) L01 *J 10 <i } + kT In p( atm) (9-45) 

where ( q\V ) and kT in the ^ 0 (T) term are expressed in cgs units. The equilibrium con- 
stant calculated with these values of (T), however, will be in atmospheres, since we 
have explicitly included the conversion factor of 1.01 x 10 6 . We shall see below that 
the thermodynamic tables are given in pressure units of atmospheres. 

In principle, then, we could calculate ji 0 (T) over a range of temperature and 
tabulate the results for future use in Eq. (9-40). This is almost the content of thermo- 
dynamic tables, but there is one last convention we must discuss, namely, the con- 
vention of the zero of energy. Throughout this book we have calculated partition 
functions in terms of energies relative to a zero of energy being taken as the infinitely 
separated atoms in their lowest energy states. From a molecular point of view, this is 
a satisfactory choice and is conceptually the most convenient choice. It is experi- 
mentally difficult to determine the energy required to separate a molecule into its 
constituent atoms, and so the thermodynamic tables utilize another convention which 
we describe below. Of course, it makes no difference what we choose as the zero of 
energy as long as we are consistent and use the same convention on both sides of a 
chemical equation. Let us go back now and reexamine at what point this arbitrariness 
in the choice of zero energy enters the calculation of partition functions. 

We have seen that to a good approximation, the molecular partition function can 
be written as 


q(V, T ) = q tnBS (V, T)q [ot (T)q vib (T)q tlec (T) (9-46) 

where, of course, q TOt and q vih do not appear in the case of an atom. If we recall the 
discussion centering around Fig. 6-2, it is in the calculation of q tltc (T) that the zero- 
of-energy convention was used. The electronic partition function is 

q t itc (T) - (D el e~ e '' /kT + co e2 e~ t9zlkT + • - • (9-47) 

If one chooses the separated atoms in their ground states as the zero of energy, then 
(cf. Fig. 6-2 or Fig. 9-2) 


«ei = -D € 

&e2 = ^el “b ( Pel ® el ) = ® el “b ^I>12 

— — D e + Asj 2 


(9-48) 


and 


^eicc (T) = co el e D ‘ lkT + co e2 e D - lkT e-^ lkT + • • • 
= e DajkT (co el + co e2 e -** t2lkT + • • •) 

= e D - /kT q° tUc (T) 


(9-49) 
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The superscript on the q° t u c (T) in the last line indicates that this is the electronic 
partition function of a molecule with its ground electronic state taken to be zero. 

If we substitute Eq. (9-49) into Eq. (9-46), we get 

q(K T) = q tnns (V 9 T)g rot (T)g vib (T)g 0 eIec (T)^- /,cT (9-50) 

The vibrational partition function is calculated using energies whose zero is taken 
as the bottom of the potential well, so that there is an energy of iAv(£j ihvj in a 
polyatomic molecule) in the ground vibrational state. It is this zero-point vibrational 
energy that leads to the numerator in the vibrational partition functions of Chapters 
6 and 8, that is, to the e~ &vjl2T in 

e -e VJ f2T 

*vibCO = 11 _ e -e VJ /T ^ (9-51 ) 

If we substitute this into Eq. (9-50), we have 

q(V, T) = q, rans (F, T)q ro ,(T){n (1 - e" 6 ^)- '}? V(?> (D '- (1 *> W*r) (9 _ 52) 

= 4,ra„s (V, T)q r JT)q 0 vib (T)q 0 ' Uc (T)e Do/kT (9-53) 

where we have written D 0 for D e — \ h v j ( c /- Fig. 9-2) and have superscripted 
q yih with a 0 since the term in brackets in Eq. (9-52) is a vibrational partition function 
whose lowest vibrational state is taken to be zero. In addition, we could have super- 
scripted q trans and q rot since these partition functions are calculated on the basis of the 
lowest translational and rotational states being zero. Therefore we can write the 
molecular partition function as 

q(V, T ) = q°(V 9 T)e D °* kT (9-54) 

where q°(V> T ) indicates that the partition function is calculated on the basis that the 
ground state of the entire molecule is taken to be zero. The factor of exp (D 0 lkT) 
“scales” the partition function to the zero of energy being the separated atoms in 
their ground states. 

Equation (9-54) can be interpreted as saying that the partition function can be 
written as the product of an internal part q°(V , T), and a scaling factor that accounts 
for the arbitrary zero of energy. Figure 9-2(a) shows the energy D 0 , and Fig. 9-2(b) 



Figure 9-2. An illustration of the arbitrary zero of energies used in Eqs. (9-54) and (9-55). (a) Shows 
the zero of energy to be the separated atoms in their ground states and (b) shows a “more 
arbitrary ” zero in which the lowest vibrational state lies e 0 ° above the zero. 
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shows a completely arbitrary zero of energy. In terms of this other zero of energy, the 
partition function is 

q(V, T) = q\V, T)<r*° 0/ * T (9-55) 

where q°(V, T) is the same partition function that occurs in Eq. (9-54). Note that if 
« 0 ° = — A> , Eq. (9-55) reduces to Eq. (9-54). In a sense, Eq. (9-55) is more arbitrary 
than Eq. (9-54) since the zero of energy has no physical basis and is completely 
arbitrary. If we substitute this into Eq. (9-45), we get 

H - e 0 ° = - kT ln( foITTo*) + kT[n p(atm) (9_56) 

Equation (9-56) clearly displays the fact that the chemical potential is calculated 
relative to some zero of energy. As T-+ 0, the molecule is found in its lowest energy 
state, which according to Fig. 9-2(b) is c 0 °* This is the significance of the zero sub- 
script on 6 0 °. It is easy to show mathematically that at 1 atm, p 6 0 ° as T -► 0 since 
T In T -> 0 as T -> 0. 

We are now ready to introduce the notation and the conventions used in most 
thermodynamic tables. We shall adopt the convention that the energy of an element is 
zero at 0°K if the element is in the physical state (i.e., gas, liquid, or solid) charac- 
teristic of 25°C and 1 atm pressure. All energies, therefore, are calculated 
relative to this convention. We shall, in addition, multiply Eq. (9-56) by Avogadro’s 
number and recognize that Np is the Gibbs free energy per mole. We shall write 
Eq. (9-56) as 

<9 - 67) 

The significance of the 0 superscript on G° indicates that this is the Gibbs free energy 
per mole at 1 atm pressure relative to the zero-of-energy convention adopted above. It 
is clear from Eq. (9-57) that G° -+ E 0 ° as T -► 0, and so E 0 ° is also the standard free 
energy at 0°K. Thus we could write G° — G 0 ° instead of G° — E 0 °. This notation is 
often found in thermodynamic tables. Furthermore, since the enthalpy H = E + pV, 
and p -► 0 as T -► 0, we also have H 0 ° = E 0 °, and one also finds the notation G° — H 0 °. 
Since G 0 ° = H 0 ° = E 0 °, the quantities G° - G 0 °, G° - £ 0 °, and G° - H 0 ° are 
equivalent. The important point is that G° — E 0 ° can be calculated without assuming 
any convention concerning the zero of energy, since this represents the standard Gibbs 
free energy relative to arbitrary zero, E 0 ° . The right-hand side of Eq. (9-57) is based 
upon the energy of a molecule in its lowest state, that is, at T — 0°K, being zero, and 
hence can be calculated for any molecule independent of the choice of E 0 °. 

Table 9-6 gives — (G° — E 0 0 )/T for a number of substances. The ratio (G° — E 0 °)IT 
rather than (G° — E 0 °) is given since (G° — E 0 °)/T varies more slowly with tempera- 
ture, and hence the tables are easier to interpolate. In addition to — (G° — E 0 °)/T, it 
is necessary to tabulate E 0 ° according to our chosen convention that the energy of an 
element is zero at 0°K if the element is in the physical state characteristic of 25°C and 
1 atm. For a molecule, E 0 ° represents the energy of a molecule at 0°K relative to the 
elements, and therefore it also represents the heat of formation at 0°K. Consequently, 
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Table 9-6. Thermodynamic functions of some selected substances 




-(C°- 

Eo°)/T cal/deg-mole 


rjrO 17 o 

ti 298 

£ 0 0 (kcal/mole) 

298.15°K 

500 K 

1000°K 

1500°K 

2000°K 

Ne 

29.98 

32.56 

36.00 

38.00 

39.44 

1.481 

0 

At 

32.01 

34.59 

38.04 

40.04 

41.48 

1.481 

0 

Kr 

34.22 

36.80 

40.24 

42.25 

43.69 

1.481 

0 

H(g) 

22.42 

24.99 

28.44 

30.45 

31.88 

1.481 

51.62 

Cl(g) 

34.43 

37.06 

40.69 

42.83 

44.34 

1.499 

28.54 

Br(g) 

36.84 

39.41 

42.85 

44.89 

46.36 

1.481 

26.90 

1(g) 

38.22 

40.78 

44.23 

46.24 

47.68 

1.481 

25.61 

<Xg) 

33.08 

35.84 

39.46 

41.54 

43.00 

1.607 

58.98 

N(g) 

31.65 

34.22 

37.66 

39.67 

41.10 

1.481 

112.54 

H 2 (g) 

24.42 

27.95 

32.74 

35.59 

37.67 

2.024 

0 

a 2 (g) 

45.93 

49.85 

55.43 

58.85 

61.34 

2.194 

0 

Br 2 (g) 

50.85 

54.99 

60.80 

64.31 

66.83 

2.325 

8.37 

I 2 (g) 

54.18 

58.46 

64.40 

67.96 

70.52 

2.148 

15.66 

0 2 (g) 

42.06 

45.68 

50.70 

53.81 

56.10 

2.07 

0 

N 2 (g) 

38.82 

42.42 

47.31 

50.28 

52.48 

2.072 

0 

CO(g) 

40.25 

43.86 

48.77 

51.78 

53.99 

2.073 

-27.202 

NO(g) 

42.98 

46.76 

51.86 

54.96 

57.24 

2.194 

21.48 

HCl(g) 

37.72 

41.31 

46.16 

49.08 

51.23 

2.065 

-22.019 

HBr(g) 

40.53 

44.12 

48.99 

51.95 

54.13 

2.067 

-8.1 

HI(g) 

42.40 

45.99 

50.90 

53.90 

56.11 

2.069 

6.7 

C0 2 (g) 

43.56 

47.67 

54.11 

58.48 

61.85 

2.238 

-93.969 

H 2 0(g) 

37.17 

41.29 

47.01 

50.60 

53.32 

2.368 

-57.107 

NH 3 (g) 

37.99 

42.28 

48.63 

53.03 

56.56 

2.37 

-9.37 

Cl 2 0(g) 

54.52 

59.49 

67.04 

71.91 

— 

2.719 

18.61 

ao 2 (g) 

52.79 

57.48 

64.65 

69.33 

— 

2.577 

25.59 

N 2 0(g) 

44.89 

49.11 

55.76 

60.27 

— 

2.291 

20.31 

N0 2 (g) 

49.19 

53.60 

60.23 

64.58 

67.88 

2.465 

8.68 

CH.(g) 

36.46 

40.75 

47.65 

52.84 

57.1 

2.397 

-15.99 

CH 3 Cl(g) 

47.45 

52.06 

59.78 

65.54 

— 

2.489 

-17.7 

CH 2 Cl 2 (g) 

55.08 

60.36 

69.58 

76.17 

— 

2.834 

-19 

CHQj(g) 

59.29 

65.81 

76.78 

84.63 


3.390 

-23 

ca 4 ( g ) 

60.15 

68.12 

81.41 

89.96 

— 

4.111 

-25 

C 2 H 6 (g) 

45.27 

50.77 

61.11 

69.46 

— 

2.856 

-16.52 

C 2 H.(g) 

43.98 

48.74 

57.29 

63.94 

69.46 

2.525 

14.52 

CH 2 0(g) 

44.25 

48.54 

55.11 

59.81 

63.58 

2.393 

-26.8 

Na 2 (g) 

46.65 

51.04 

57.14 

60.77 

— 

2.484 

— 

K 2 (g) 

51.06 

55.57 

61.76 

65.47 

— 

2.566 

— 


Source : G. N. Lewis and M. Randall, Thermodynamics , revised by K. S. Pitzer and L. Brewer (New York: 
McGraw-Hill, 1961). 


one often finds the notation AH 0 ° instead of E 0 ° . Thus we can calculate equilibrium 
constants from the tables since Eq. (9-40), is, in the notation introduced above. 


-R In K p 



(9-58) 


where A has the same significance here as in Eq. (9-41). 

We shall illustrate the use of these tables by means of examples. First let us show 
how the tables are consistent with our previous calculations. Consider the reaction 
H 2 + I 2 ^ 2HI. For hydrogen. 




= 9.95 x 10 25 V at 1000°K 
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According to Eq. (9-57), 



H ta f 


9.95 x 10 25 x 1.38 x 10 


= 32.87 


1.01 x 10 6 
at 1000°K 



in good agreement with Table 9-6. Of course, the value given in Table 9-6 must be 
considered to be more accurate than the value we have calculated here. Since hydrogen 
is a gas at 25°C and 1 atm, we find from Table 9-6 that E 0 ° = 0. For I 2 , the param- 
eters in Table 6-1 give — (G° — E 0 °)/T = 64.40 given in Table 9-6. Since I 2 is a 
solid at 25°C and 1 atm, E 0 ° is not zero, but is 15.66 kcal/mole, the heat of sublimation 
of I 2 at 0°K. If we substitute the values from Table 9-6 into Eq. (9-58), we get 
K p = 32.5, compared to 30.9 from the simpler equations of Chapter 6. 

For the reaction H 2 + \0 2 ^ ± H 2 0 at 1000°K, the tables give log K p = 10.00 com- 
pared to the experimental value of 10.06 and the less accurate value 10.2 calculated in 
the previous section on the basis of the rigid rotor-harmonic oscillator model. Note 
that K p is expressed in atmospheres, since the tables are based on the standard pressure 
being 1 atm. 

Table 9-6 also gives H 298 — E 0 °, which is the enthalpy at 298°K and 1 atm pressure 
relative to the zero of energy already explained. One often finds H 298 — if 0 °, which, of 
course, is equivalent to H 29S — E 0 °. For 1 mole of an ideal gas, 


H = E + pV = E+RT 



and so at temperatures high enough that the rotation is classical, we have 

RQ 

H T ° — E 0 ° = § RT + \RT + e@ v/T ” j + RT (diatomic molecule) 


(9-59) 


(9-60) 


= iRT + $RT + 


v R®vj 

ie^-l 


+ RT 


= iRT+iRT+Y. 

J 


R®vj 
e e VJ /T _ j 


+ RT 


(linear polyatomic molecule) 


(9-61) 

(nonlinear polyatomic molecule) 


(9-62) 

Note that there is no contribution from the zero-point vibrational energy here as there 
is in Eq. (6-22), since we have taken the energy of the ground vibrational state to be 
zero. A value of 2.07 kcal/mole for a diatomic molecule comes from the 1RT/2. 
Values higher than this are due to the vibrational contributions; a value lower than 
this (such as H 2 ) indicates that the rotation is not completely classical, and Eq. (6-35) 
should be used. 

Although entropies are not given in Table 9-6, they can be easily obtained from the 
data given there since 


= ^298 - Vj _ ^98 ~ Eo° j 


(9-63) 


The values of the entropy obtained in this way are in excellent agreement with those 
given in Tables 6-3 and 8-3. 
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Table 9-6 can also be used to calculate values of D 0 for various molecules. For 
example, consider D 0 for H 2 . Table 9-6 shows that the energy of a hydrogen atom 
relative to a hydrogen molecule (both at 0°K) is 51.62 kcal/mole. The value given in 
Table 6-1 is twice this. 

For H 2 0, we have 

H 2 + i0 2 ► H 2 0 + 57. 107 kcal at 0°K 

We add to this the following two reactions 

2H > H 2 + 103.24 kcal\ 

O ► i0 2 + 58.98 kcalj ai u ^ 

to get 

2H + 0 > H 2 0 + 219.3 kcal atO°K 

At 0°K, both reactants and products are in their ground states, and so the 219.3 kcal 
represents D 0 for H 2 0, in agreement with Table 8-1. 

As a last example, we calculate D 0 for HI. This case is slightly different since iodine 
is a solid at 25°C and 1 atm. Table 9-6 gives that 

iH 2 (g) + il 2 (s) > HI -6.7 kcal atO°K 

We add the following equations to this: 

H > iH 2 (g) + 51.62 kcal at 0 a K 

I(^) ► il 2 (s) + 25.61 kcal at 0°K 

The result is 

H + 1(g) > HI + 70.53 kcal at 0°K 

and hence D 0 for HI is 70.53 kcal/mole. 

Thermodynamic tables contain a great deal of thermodynamic and/or statistical 
thermodynamic data. Their use requires some amount of practice, but it is well worth 
the effort. Problems 9-17 through 9-26 are meant to supply this practice. 
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PROBLEMS 

9-1. Consider the reaction A^2B. The canonical ensemble partition function for an 
ideal binary mixture is 

„ NA~ NB 
#A QB 


Q(N Ai N Bi V,T) 


N a \N b \ 
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Minimize the Helmholtz free energy with the stoichiometric constraint 2N A + N B = constant 
to show that 

N b * 2 =(? b 2 
N a * “ q A 

where N A * and Nb* are the equilibrium numbers of A and B. Can you generalize this approach 
to the reaction v a A + v b B ^v c C + v p D and derive Eq. (9-7)? 

9-2. Using the data in Table 6-1 (plus atomic spectroscopic tables), calculate the equi- 
librium constant for the reaction 2 K ^K 2 at 800°K and 1000°K. The experimental values* 
at these temperatures are 0.673 and 0.123, respectively. 

9-3. Calculate the equilibrium constant K P for the reaction I 2 ^21, using the data in 
Table 6-1 and the fact that the ground electronic state of the iodine atom is 2 P 3 , 2 and that 
the first excited electronic state lies 0.94 eV higher. The experimental values! are 

T(°K) 800 900 1000 1100 1200 

K P 1.14 xlO" 2 4.74 xlO" 2 0.165 0.492 1.23 

9-4. Show that the equilibrium constant for a reaction such as 

HCI + DBr ^DCI + HBr 

approaches unity at sufficiently high temperatures. 

9-5. Calculate the equilibrium constant for the reaction N 2 14 + N 2 15 ^2N 14 N 15 and 
compare your results to the experimental values of Joris and Taylor. t 
9-6. Derive Eq. (9-24). 

9-7. Show that the Teller-Redlich product rule is valid for a diatomic molecule under the 
Born-Oppenheimer approximation. 

9-8. Calculate the enthalpy of reaction for H 2 + I 2 ^2HI around 300°K. 

9-9. Calculate the equilibrium constant for the reaction iN 2 + i0 2 ^NO, using the data 
in Table 6-1 . The observed values are 

7X°K) 1500 2000 2500 

K P 2.4 x 10" 3 1.5 x 10" 2 4.5 x 10" 2 

Why is the agreement not as good as you obtained in the other problems in this chapter? 

9-10. Calculate the equilibrium constant for the reaction C0 2 ^CO + i0 2 at 3000°K. 
The experimental value is 0.378 (atm) 1/2 . 

9-11. Using the data in Tables 6-1 and 8-1, calculate the equilibrium constant for the 
water gas reaction C0 2 + H 2 ^CO + H z O at 900°K and 1200°K. The experimental values 
at these two temperatures are 0.46 and 1 .37, respectively. 

9-12. Usfhg the data in Tables 6-1 and 8-1, calculate the equilibrium constant of the 
reaction 3H 2 + H 2 ^2NH 3 at 400°C. The accepted value is 3.3 x 10 “ 4 (atm) -2 . 

9-13. Calculate the temperature at which molecular nitrogen is 99 percent dissociated at 
100 atm, 1 atm, and 0.1 atm. 

9-14. For the two ionization processes 

H^H + +e" 

Cs ^Cs + + e~ 

derive an expression for the ratio of the equilibrium constants K B /K C 3 where 


Kh = PH ±£lz and * c .= pc * +p ‘- 


P H 


PCs 


* See C. T. Ewing, et al. y J. Chem . Phys.> 71, p. 473, 1967. 
t See Perlman and Rollefson, J. Chem. Phys. f 9, p. 362, 1941. 
j See G. C. Joris and H. S. Taylor, J. Chem. Phys. 9 7, p. 893, 1939. 
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in terms of the respective ionization potentials 1 H and I c * . What is the high-temperature limit 
of K H /Kc* ? At room temperature do you expect KhIK c * to be greater than, less than, or 
approximately equal to one? 

9-15. Calculate the equilibrium constant for the first ionization of argon at 10,000°K. 
Use the data in A. B. Cambel, Plasma Physics and Magnetofluid Dynamics (New York: 
McGraw-Hill, 1963, p. 119). 

9-16. Estimate the temperature at which gaseous atomic hydrogen would be at least 
99 percent dissociated at 10“ 3 torr pressure into electrons and protons. Assume classical 
statistics, but remember that the electron has a spin degeneracy of 2. 

9-17. Calculate the equilibrium constants of Problem 9-2, using the data in Table 9-6 
and the Handbook of Chemistry and Physics. 

9-18. Calculate the equilibrium constants of Problem 9-8, using the data in Table 9-6. 

9-19. Calculate the equilibrium constants of Problem 9-9, using the data in Table 9-6. 

9-20. Calculate the equilibrium constants of Problem 9-10, using the data in Table 9-6. 

9-21. Calculate the equilibrium constants of Problem 9-11, using the data in Table 9-6. 

9-22. Calculate the equilibrium constants of Problem 9-12, using the data in Table 9-6. 

9-23. Verify the results in Table 9-6 for Ne, using the results of Chapter 5. 

9-24. Verify the results in Table 9-6 for HBr, using the results of Chapter 6. 

9-25. Verify the results of Table 9-6 for CH 3 CI, using the results of Chapter 8. 

9-26. Verify the results of Table 9-6 for N0 2 , using the results of Chapter 8. 
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QUANTUM STATISTICS 


In Chapter 4 we derived the two fundamental distribution laws of statistical mechanics. 
One, the Fermi-Dirac distribution, applies to systems whose /V-body wave function is 
antisymmetric with respect to an interchange of any two identical particles; the other, 
Bose-Einstein statistics, applies to systems whose N-body wave function is symmetric 
under such an interchange. AH elementary particles that have a half-odd-integral 
spin, such as the electron and the proton, obey Fermi-Dirac statistics and are called 
fermions; elementary particles that have an integral spin, such as the deuteron and 
the photon, obey Bose-Einstein statistics and are called bosons. 'The classification of 
compound particles can become a delicate problem at times, but as long as the binding 
energy of the compound particle is large compared to all other energies in the problem 
(which will be so for all cases that we shall discuss), one can say that a compound 
particle containing an odd number of fermions, such as He-3, will obey Fermi-Dirac 
statistics, and one with an even number, such as He-4, will obey Bose-Einstein statistics. 
Incidentally, it should be pointed out that it is not obvious that all particles occurring 
in nature are necessarily fermions or bosons, but there appear to be no known 
exceptions. 

The basic equations associated with the two fundamental distribution laws are 
[Eqs. (4-24) through (4-28)] 

S(F, T, A) = n (1 ± Ae-**)* 1 (1 0-1 ) 

k 

(,<w ’ 

(10 - 3) 


k 


Xe k e~^ 

1 ± Xe~ fiCk 


(10-4) 
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and 

pV = ±kT'£ ln(l ± Ae - "*) (10-5) 

k 

where A = exp(p kT). I A these equations the upper sign (+) corresponds to Fermi- 
Dirac statistics, and the lower sign (— ) corresponds to Bose-Einstein statistics. In 
order to discuss the thermodynamic properties given by these equations, it is necessary 
to solve Eq. (10-2) for k in terms of N and the /Je’s, and since the e k ’s are functions 
of F, this procedure gives A as a function of N , F, and T This solution for k is then 
substituted into Eqs. (10-4) and (10-5) to give E and p 9 and hence other thermo- 
dynamic functions, in terms of N , F, and T . (See Problem 10-1.) 

The difficulty is that it is not possible to solve Eq. (10-2) analytically for all values 
of k. In Chapter 4 we showed that if k is small, both Fermi-Dirac and Bose-Einstein 
reduce to Boltzmann, or classical, statistics. In this case k = N q [cf. Eq. (10-2)], and 
all the mathematical manipulations are easy to perform. Thus the magnitude of k 
can be regarded as a measure of the degree of quantum behavior of the system. Small 
values of k correspond to classical or near-classical behavior, and large values of k 
apparently correspond to quantum statistical behavior. Table 4-1 and the discussion 
surrounding it show that quantum effects become important for low temperatures and 
high densities. Thus we may expect A to be small for high temperatures and low 
densities and large for low temperatures and high densities. We shall verify this later 
in the chapter. 

Fortunately most systems of interest can be described by Boltzmann statistics and 
hence are characterized by small values of A. This led us to the detailed discussion of 
classical ideal gases in Chapters 5 to 9. 

There are several important and interesting systems, however (see Table 4-1), which 
cannot be described by classical statistics, and hence for which A is not small. In this 
chapter we shall discuss these applications. 

In Section 10- 1 we shall study an ideal Fermi-Dirac gas for values of A such that 
a series expansion of Eqs. (10-2) through (10-5) is useful. The first terms of these 
expansions represent the Boltzmann, or classical, limit, and so these expansions are 
useful in a temperature and density region where there are only small deviations from 
classical behavior, that is, high temperature and low density. We shall find the interest- 
ing result that even though we neglect the intermolecular forces in an ideal gas model, 
the equation of state will no longer be p = pkT, but will be a virial expansion, that is, 
a power series in the density. In Section 10-2 we shall examine the case for large values 
of A (low values of the temperature or high values of the density). In this region we 
expect large deviations from classical behavior. Such systems are said to have large 
quantum effects or to be strongly degenerate . We shall see that although the results 
of these two sections are derived for an ideal gas, they serve as an interesting model for 
the electrons in a metal, and, in fact. Section 10-2 considers this model specifically. 

Then in Section 10-3 we shall treat Bose-Einstein statistics in the range of A, where 
a series expansion of Eqs. (10-2) through (10-5) is useful. The results are very similar 
to those of Section 10— I. In Section 10-4 we examine the low- temperature or high- 
density limit of Eqs. (10-2) through (10-5). This section, although more mathematically 
involved than the others, is very interesting, since we find that a system of ideal bosons 
undergoes a kind of phase transition as the temperature is lowered. We shall discuss 
the implications of this result with respect to the well-known A-transition that occurs 
in He-4 at 2.18°K. In Section 10-5 we shall treat an ideal gas of photons or, in reality, 
electromagnetic radiation in thermal equilibrium. In this section we shall derive the 
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fundamental blackbody radiation distribution law first derived by Planck and which 
led to the development of quantum mechanics. 

Section 10-6 is devoted to an alternative formulation of quantum statistics, namely, 
through the density matrix, which is the quantum statistical analog of the density of 
phase points in classical phase space. Lastly, Section 10-7 is a rigorous discussion of 
the classical limit of Q . 


10-1 A WEAKLY DEGENERATE IDEAL FERMI-DIRAC GAS 


As the title of this section indicates, we shall consider an ideal gas of fermions in a 
region where X is small enough that we may represent the deviations from classical 
behavior by a series expansion in X . Consider Eqs. (10-2) and (10-4) with the upper 
signs: 

_ Xe~ fitk 


p V = kT'£ ln(l +Xe- p * k ) 


(10-7) 


The e fc ’s in these equations are the eigenvalues of a particle in a cube [c/. Eq. (5-3)], 
and the one index k really stands for n X9 n y9 and n z : 

h 2 


g w p/ 2 /3 ^ n x + n y + n z ) n xi n y> n z 2, 


(10-8) 


where we have replaced a 2 by V 2/3 . In Chapter 5 we converted sums over the energy 
states to integrals over energy levels. We used the argument that at room temperature, 
successive values of elkT differed so little from each other that e k (e„ x „ ynx ) was essen- 
tially a continuous function. We derived an expression for the density of states co(e) 
between e and e + de on the same basis. In this chapter we do not wish to restrict our 
discussion to high temperatures, but the very same type of argument applies if we 
consider the thermodynamic limit of a large volume [cf. Eq. (10-8)]. Thus we may 
consider the energy states in Eq. (10-8) to be continuous and write 


12 m\ 3/1 r 00 Xe 1/2 e de 

/2m\ 3/2 r°° 

pV = InkTi-j) V I e 1/2 ln(l + Xe~ fit ) de 


(10-9) 

(10-10) 


This procedure is valid only because the summands in Eqs. (10-6) and (10-7) are 
finite continuous functions of X for all the e k . Equations (10-6) and (10-7) indicate 
that the range of X is 0 <; X < oo. Because of the minus sign in the denominator of the 
expressions for Bose-Einstein statistics, such a conversion of a summation into an 
integration must be done with care. 

The integrals occurring in Eqs. (10-9) and (10-10) cannot be evaluated in closed 
form, but they can be written as power series in X by expanding the denominator of 
Eq. (10-9) and the logarithm in Eq. (10-10) in a power series in X and integrating term 
by term (see Problem 10-4): 

1 »(— 1 ) ,+1 A' 

P ~A 3 ,k l 3 ' 2 


(10-11) 
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pis (-1 y +i x i 

kT A 3 ,tl / 5/2 


(10-12) 


These series are valid only for small values of X. Equation (10-1 1) gives p as a power 
series in A, and Eq. (10-12) gives p; kT as a power series in X. We wish to solve 
Eq. (10-1 1) for A as a function of p, and then substitute this into Eq. (10-12) to give 
pjkT as a function of p. 

The problem of solving an equation like Eq. (10-11) is called reversion of a series 
and can be done in general. The general solution is involved, however, and since we 
are interested only in small deviations from classical behavior in this section, we can 
use a straightforward algebraic method that readily gives the first few terms. We 
assume that X is a power series in p and write 

X = Oq + fljP 4- #2 p 2 "h ’ ’ 

We then substitute this into Eq. (10-11) and equate coefficients of like powers of p 
on both sides of the equation to get 


a 0 =0 
a x — A 3 


*2-|^ = 0 

*1*2 , fl l 3 _ n 
a 3 2 1/ 2 + 3 3 / 2 — u 


Thus A is 


A = pA 3 + 


2 3 / 2 


(pA 3 ) 2 + ^ - ^(pA 3 ) : 


(10-13) 


(10-14) 


We now substitute this into Eq. (10-12) and get 


P A 3 

kf~ P + 2 s > 2 P 


2 + ( l “ 3 ^)' 


AV + 


(10-15) 


This equation is in the form of a virial expansion for the pressure, which is custom- 
arily written in the form 


= P + B 2 (T)p 2 4- B 3 (T)p> + 


(10-16) 


where Bj(T) is a function of only the temperature and is called the jth virial coefficient. 
Virial coefficients reflect deviations from ideality, or intermolecular interactions. 
Although there are no intermolecular forces in this case, the particles, nevertheless, 
experience an effective interaction through the symmetry requirement of the N-body 
wave function. In the case of fermions, this interaction may be said to be repulsive 
since the first correction to ideal behavior, B 2 (T), is positive and hence increases the 
pressure above that of a classical ideal gas under the same conditions. Note also that 
B 2 (T) is 0(A 3 ). But A is just the thermal De Broglie wavelength, and so we see that 
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quantum statistical effects decrease as the thermal De Broglie wavelength decreases 
In fact, if Eq. (10-15) is written in a dimensionless form, we see once again that it is 
the dimensionless ratio A 3 /V that is a measure of the quantum effects. 

The Fermi-Dirac equation for E, Eq. (10-4) with the positive sign, also can be con- 
verted to an integral and then written as a series expansion in A: 

1 oo (—Y\ l + 1 l l 

E = i ^ T A 5 I ? 1 ' / ? /2 -- o O- 1 7) 


If Eq. (10-14) for A is substituted into this, we get 


E = iNkT 


( 1 +^2P + "-) 


( 10 - 18 ) 


All other thermodynamic functions can be obtained from Eqs. (10-14), (10-15), and 
(10-16). For example, the chemical potential p follows immediately from Eq. (10-14) 
since A = exp (ii/kT). The entropy can be obtained from G = iiN = E — TS + pV 
These series expansions for the thermodynamic functions are valid only for small 
values of A, or for temperatures and densities such that quantum effects are a small 
correction to the classical limit. The leading terms in all these expansions are the 
classical limits obtained in Chapter 5. Now we shall discuss the case for large values 
of A. 


10-2 A STRONGLY DEGENERATE IDEAL FERMI-DIRAC GAS 

In this section we shall treat the case of an ideal Fermi-Dirac gas at low temperature 
and/or high density. For concreteness we shall develop the results in terms of the free 
electron model of metals, where the valence electrons of the atoms of the metal are 
represented by an ideal gas of electrons. It is possible to understand a number of 
important physical properties of some metals, in particular the simple monovalent 
metals, in terms of the free electron model. There are several reasons why such a 
simple model can be used. One is that although the electrons do indeed interact with 
each other and with the atomic cores through a Coulombic potential, this potential 
is so long range (1/r) that the total electronic potential that any one electron “sees” 
is almost constant from point to point in the metallic crystal. Furthermore, in general 
many of the physically observable properties of a metallic crystal are due more to 
quantum statistical effects than to the details of the electron-electron and electron- 
ionic core interactions. 

Consider Eq. (10-3) (with the positive sign in the denominator), which gives the 
number of particles in the molecular energy state k : 

Ae“** 1 

"* ~ 1 + ~ 1 + ( 10 - 19 ) 

As in the previous section, e k is essentially a continuous parameter, and we can write 

m = r+ - ^ ( — ,-) do-20) 

where /(e) is the probability that a given state is occupied. This equation is plotted in 
Fig. 10-1, where, in particular, /(e) is plotted versus efoi for fixed values of /fyi. In the 
extreme limit of low temperatures, that is, T = 0, the distribution is unity for energies 
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Figure 10-1. The Fermi-Dirac distribution as a function of e (in units of jx) for T = 0 and fin = 0.10. 


less than and zero for energies greater than fi . The quantity fi is, in general, a func- 
tion of temperature, and so we indicate the value of fi at T = 0 by fi 0 . 

The zero-temperature limit of /(e) is very simple, and it is instructive to understand 
its behavior. At the absolute zero of temperature, all the states with energy less than 
/z 0 are occupied and those with energy greater than fi 0 unoccupied. Thus fi 0 has the 
property of being a cutoff energy. According to Eq. (1-35), the number of states with 
energy between e and e 4- de is 


co(e) de 


2m\ 3/2 


-a?) 


Ve if2 de 


( 10 - 21 ) 


where we have introduced a factor of 2, since an electron has two spin states (± 1) 
associated with each translational state. We can find fi 0 immediately from the fact that 
all the states below e = fi 0 are occupied and all these above are unoccupied. Thus if 
N is the number of valence electrons. 


N - V Jc 

from which we write 

h 2 / 3 \ 2/3 /N\ 2/3 

- = 2^U Ik] 


(10-22) 


(10-23) 


Using the fact that the molar volume of Na, say, is 23.7 cm 3 mole, and that a sodium 
atom has one valence electron, the quantity fi 0 is 3.1 eV. We see, then, that at T = 0°K, 
the conduction electrons in sodium metal fill all the energy states up to 3.1 eV accord- 
ing to the Pauli exclusion principle of allowing only two electrons to occupy each state. 
The quantity fi 0 is called the Fermi energy of a metal and is typically of the order of 
1-5 eV (see Problem 10-6). 

This is a very significant result since a plot as in Fig. 10-1 shows that at room 
temperature, where f}fi 0 is of order of 10 2 , the distribution /(e) is still essentially a step 
function like at T = 0°K. Compared to a characteristic temperature fi^k^ room tem- 
perature may be considered to be zero, and it is an excellent first approximation to use 
the distribution 


/(e) =1 e < fi 0 

= 0 e > fi 0 


(10-24) 
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at room temperature. The quantity fi 0 .k is called the Fermi temperature and is denoted 
by T f . Fermi temperatures are typically of the order of thousands of degrees Kelvin. 
In this approximation, the thermodynamic energy is [cf. Eq. (10-4)] 

*) v i * * 

= l^o (1 0-25) 

where we have written E 0 to emphasize that this is a T — O' K result. The energy E 0 
is the zero-point energy of a Fermi-Dirac gas. Equation (10-25) implies that the 
contribution of the conduction electrons to the heat capacity is zero, in sharp contrast 
to the equipartition value of 3k;2 for each electron. The physical explanation of this 
is that in order to contribute to the heat capacity, the electrons must be excited to 
higher quantum states, but since fi 0 is so large compared to kT, only a small fraction 
of all the particles will be within kT from the top of the distribution (near fi 0 ), where 
there are vacant states lying above. Thus a very small fraction of all the electrons can 
contribute to the heat capacity, and so the experimental heat capacity is almost zero. 

The pressure is given by [cf. Eq. (10-10) with the additional factor of 2 due to the 
two allowed spin orientations of an electron]: 

( 2m\ 

-~ 2 j j £ 1/2 ln(l -I- ^°" £) ) de 

In order to evaluate this integral, one can neglect unity compared to eP iuo ~ z) since 
~ e ) is much larger than one over most of the range of integration. Therefore 

Po 

s 1/2 (Mo - e) de 

o 

= iNfi 0 IV (10-26) 

This “ zero-point pressure ” is 0(10 6 ) atm. The occurrence of h in this expression shows 
that the zero-point pressure is a quantum effect. It follows from G = Njx = E — TS + pV 
and Eqs. (10-23), (10-25), and (10-26) that S 0 = 0. This is to be expected, since there 
is only one way to put the N indistinguishable particles into the lowest possible 
quantum states. 

Equations (10-22) through (10-26) are based on the zero-temperature distribution 
function, Eq. (10-24). It is not difficult to calculate corrections to these zero-tempera- 
ture results as an expansion in powers of a parameter r\ = kT/fi 0 . At room tempera- 
ture, rj = 0(10“ 2 ), so that such an expansion converges quickly. To determine this 
expansion, first notice that all the thermodynamic quantities N, E , /?, can be 
written as 

r 00 

/= f(s)h(e)de (10-27) 

J 0 

where the following table gives examples of I and h(e ) : 
h(e) 

H 4 
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Figure 10-2. The Fermi-Dirac distribution function and its derivative for a value of jfyx - - 0.10. 


Figure 10-2 shows /(e) and the derivative of /(e), /'(e), versus e for fifi = 0.10. This 
shows that even at fifi = 0.10 (which, we shall show below, corresponds to a tempera- 
ture at which most metals are liquid), /(e) is a step function with rounded edges, and 
its derivative therefore is a function that is zero everywhere except around the region 
in which e = fi. Thus it is convenient to express the integral I in Eq. (10-27) in terms 
of /'(e). This is done by an integration by parts: 

/ = - f D °/'(e)ff(e) de (1 0-28) 

J o 

where 

H(e)= f h(x) dx (10-29) 

J o 

and we have used the fact that h(e) = 0 at e = 0. Now since /'(e) is nonzero only for 
some small region around e = fi, the only important values of //(e) will be for values 
of e around e = fi. Thus we expand //(e) in a Taylor series about e = fi: 

H(e) = H(/i) + (6 - + i(e - _ +•" (10-30) 

If we substitute this into Eq. (10-28), we get 
(dH\ 1 /d 2 H\ 

f = h(m) l o + (*:) .,/■ +~ 2 

where 


Lj = - f (e - nW'(fi) de 

J o 


(10-32) 


The first of these integrals, L 0 , is simply unity since it is equal to /( 0) — /( oo). In the 
others, such as L x and L 2 , we may replace the lower limit by — oo since /'(e) is so 
small from — oo to 0 that there is no contribution to the integral. If we let x = /?(e — fi), 
we have 


•=-f 

'' R J J_ 


x*e* 


x\2 


^^(l+e*) 


dx 7 = 0, 1,2, .. 


(10-33) 


All of these integrals with odd values of j vanish, since except for the x j , the integrands 
are even functions of x. (See Problem 10-10.) The integral for L 2 is standard and may 
be found in tables: 


00 x 2 e x dx _ 7i 2 
-oo (1 + e*) 2 = T" 



168 QUANTUM STATISTICS 


This gives L 2 = n\kT) 2 3 and, by Eq. (10-31), 
/ = H(») + j (kT) 2 H"(fi) + 


(10-34) 


As an example of the use of this equation, we calculate N. In this case h(e) = 
4n(2m h 2 ) 3 ? 2 Ve il2 , and so 


*=f(f?) iv»[i +!■«>-+•••] 

Using Eq. (10-23) for p 0 , this becomes 

r n 2 1 2/3 

tt, = ^l+-(fr i )- 2 + ---j 


(10-35) 


= M 1 +^(^) 2 + -"] 


This gives n 0 ;n as a power series in We can get as a power series in 

1 1 = (/S/io) -1 by taking the reciprocal of this equation 


jt 


-=i-^(fc0- 2 + - 

M 0 12 

and then substituting n = ^(1 — jt 2 /12(jfyt) 2 + ■ • •) into the right-hand side to get 




(10-36) 


This equation shows that fi changes slowly with temperature and is approximately 
fi 0 throughout the entire solid-state range of a metal. 

We can use Eq. (10-34) to calculate other thermodynamic properties such as E. 
This gives 


E= f(S ) 3/2 ^ 5/2[1 + l(M ~ 2 + ' ‘ 

= £o(^) 5/2 [ I + |* 2 (fci)- 2 + -] 


We can use Eq. (10-36) now to write E as a power series in t] : 
5n 2 , 1 


= Eo 


1 4 


12 


r + ■■■ 


(10-37) 


The contribution of the conductance electrons to the heat capacity is 


c K 


n 2 NkT _ n 2 / T\ 

2ifi 0 /k) ~ T \7>j 


(10-38) 


where we have introduced the Fermi temperature T F . This equation predicts the molar 
electronic heat capacity of metals to be of the order of 10 -4 T cal/deg-mole, and this is 
observed for many, but not all, metals to which the free electron model is expected to 
be applicable. 
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Fermi-Dirac statistics have been applied to other physical systems besides the elec- 
trons in metals. Two such applications are the theory of white dwarf stars and 
“ nuclear ” gases. 


10-3 A WEAKLY DEGENERATE IDEAL BOSE-EINSTEIN GAS 


In this section we shall treat an ideal Bose-Einstein gas in a region where X is small 
enough that we may use series expansions in X as we did in Section 10-1 for an ideal 
gas of fermions. We shall see that the results of this section are very similar to those of 
Section 10-1. In this case we use Eqs. (10-2) and (10-4) with the lower signs: 


„ Xe- fi£ * 
kt y 

jtil-Xe-^) 

pV= -fcT £ln(l -Xe-t**) 


(10-39) 

(10-40) 


The energy values here are also given by Eq. (10-8). 

In Section 10-1 we converted these summations to integrals, since for large V the 
e k ’s vary essentially continuously. The discussion following Eq. (10-10) states that 
such a procedure is valid only because the summands are finite continuous functions 
of X for all e k . If we let e 0 be the ground translational energy state, we see from 
Eq. (10-39) that X is restricted to the values 0 < X < e Qe °. Thus if we wish our results to 
be valid for all values of X , we must take care in converting the summations in 
Eqs. (10-39) and (10-40) to integrals. In particular, we must single out the ground 
state and write 




Xe~ p£ ° Xe'^ 

1 - Xe-^ + 1 - te~ pek 


(10-41) 


for Eq. (10-39), where the k ^ 0 means a summation over all states other than the 
ground state. Now since 0 < X < e Pe °, and e k > e 0 for all k ^ 0, the summation here 
can be converted to an integral by the same method that we used in Section 10-1. 
Equation (10-41) becomes 


Xe~ fi£0 
1 - Xe~ fieo 


2 m\ 3f2 / r 00 Xe 1/2 e de 

J?) V l >£0 1-Xe-t* 


(10-42) 


According to Eq. (10-8), the ground state e 0 = 3h 2 f 8mV 2/3 , but we can always 
redefine the energy such that e 0 = 0. Physical results must be independent of where 
one sets the zero of energy of a system, and the following equations are simpler if we 
choose the energy such that e 0 = 0. Equation (10-42) becomes 


N 12 m\ 3/2 r 00 Xe i,2 e fie de X 

P=1 V = 2n W) -Lo 1 -Ae“* + V(\ - A) 


(10-43) 


Similarly, Eq. (10-40) is 

= _ 27 t ( fr ) J >o £ ' /2 1”(1 “ ke ~ fC ) ln (l “ *) ( 10 - 44 ) 

where in both Eqs. (10-43) and (10-44), 0 <, X < 1. 
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The second terms of both equations contain a factor of l/V. Ordinarily it is legiti- 
mate to ignore such terms, since we are always interested only in the thermodynamic 
limit, where V -* cc. We must be careful here, however, since if A-»l, the term 
A/( 1 — X) becomes very large, and X/( 1 — X) divided by V is not necessarily negligible 
compared to the first term in Eq. (10-43). In this section we shall consider only the 
case where X is close to zero, and so the l/V terms in Eqs. (10-43) and (10-44) are 
not important. But in the next section we shall consider the strongly degenerate case 
in which 1, and we shall see that in this case the l/V terms contribute in an 
important way to the thermodynamic functions. 

The integrals occurring in Eqs. (10-43) and (10-44) can be evaluated as power 
series in X in much the same way as Eqs. (10-9) and (10-10). The result is 


P = 


^ 3 ^ 3 / 2 ^) 


and 


_P_ 

kT 


^3 05 / 20 *) 


where 

GO 

M = E Tn 
1=1 l 


(10-45) 


(1 0-46) 


(10-47) 


Equation (10-45) gives p as a power series in X, and Eq. (10-46) gives pjkT as a power 
series in X. We can derive an expression for pjkT as an expansion in the density by 
reverting Eq. (10-45) and substituting this result into Eq. (10-46): 


P 

pkT 


= 1 - 


A 3 

2 5 ' 2 P + "‘ 


(10-48) 


The second virial coefficient in this case is negative, implying that the effective inter- 
action between ideal bosons is attractive, in contrast to the case for fermions. 

The Bose-Einstein equation for E, Eq. (10-4) with the negative sign, can be con- 
verted to an integral and then written as a series expansion in X: 

E = iVkTXg 5 l 2 (X) 


In terms of p, this becomes 


E = 


^NkT^l - 



(10-49) 


All other thermodynamic functions for a weakly degenerate ideal gas of bosons 
follow in a similar way. Just as in Section 10-1, these virial expansion expressions for 
the thermodynamic functions are useful only for small values of X or p and represent 
small quantum corrections to the limiting classical results. In the next section we shall 
discuss the region of strong degeneracy, where X approaches its largest allowed value 
(X -¥ 1), and the l/V terms in Eqs. (10-43) and (10-44) cannot be ignored. 
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10-4 A STRONGLY DEGENERATE IDEAL BOSE-EINSTEIN GAS 

We now consider the situation when X is not necessarily small. Let us return to 
Eqs. (10-43) and (10-44) for the density and pressure of an ideal Bose-Einstein gas 


„ 12 m\ 3/2 f “ XE l/2 e~ t/kT J 

P ~ 2n \l 7) J 0 1 - Xe~ c/kT d£ + 


V(l - X) 

J o e ' /2 ,n (! “ Xe~ t/kT ) de-^ ln(l - X) 


or 


+ (10-50) 

jj-jirf-pllld-l) (10-51) 

where # 3/2 (A) and# 5/2 (A) are defined by Eq. (10-47). Since X can approach unity, we 
cannot ignore the 1 jV terms in these equations. 

Equation (10-3) shows that the average number of particles in their ground state is 

X 


and so it is clear that 0 <; X < 1. Note that there is no such restriction on the range of 
X in the Fermi-Dirac case. There 0 < X < oo. The function # 3/2 (A) in Eq. (10-50) is a 
bounded, positive, and monotonically increasing function of X in the range 0 < X < 1. 
At X = 1 the first derivative diverges, but <? 3/2 itself is finite since 

*3/2(1) = t jm = C(D = 2.612 .. . (1 0-52) 

where C(n) is the Riemann zeta function, defined by (see Problem 1-63) 

C(«)=£i (10-53) 

1=1 l 

The function g 3j2 ( A) is plotted in Fig. 10-3. 


£3/2 M 



x 

Figure 10-3. The function p 3/2 (A) versus A. 
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Figure 10-4. The graphical solution of Eq. (10-50) for A as a function of p and T. The dotted lines are 
03 / 2 (A) and A 3 A/F (1 — A). The solidlines are pA 3 = constant and# 3 / 2 (A) 4 - A 3 A/F (1 — A). 

In order to determine the equation of state, we must determine A as a function of p 
and T by solving Eq. (10-50) for A, and then substituting this into Eq. (10-51) for 
p l kT Equation (10-50) cannot be solved analytically, but can be solved graphically 
by plotting both sides of the equation on the same graph and picking off the inter- 
section. In particular, we shall plot pA 3 and p 3/2 (A) 4-A 3 A/F(l — A) versus A. (See 
Fig. 10-4.) For fixed values of p and T , the function pA 3 is just a constant and so 
appears as a horizontal line. The function p 3/2 (A) 4- A 3 A/F(1 — A), on the other hand, 
deserves some thought. Since V is large, the term A 3 AT(1 — A) is small for all values 
of A except those where A is very close to unity. In fact, this term is negligible compared 
to p 3/2 (A) except when A = 1 — 0(1 V ), say 1 — aj V, where a is some positive number 
(remember that A must be less than 1). Thus a graph of p 3/2 (A) 4- A 3 A/F(1 — A) is 
indistinguishable from a graph of p 3/2 (A) everywhere except where A« 1. This sum 
p 3/2 (A) 4- A 3 A/F(1 — A), along with its two contributions, is plotted in Fig. 10-4. The 
dashes are the two separate terms, and the solid line is their sum. Note that the value 
of A given by the intersection of the curves pA 3 = constant and p 3/2 (A) 4- A 3 A/F(1 — A), 
and the value of A given by the intersection of pA 3 = constant and p 3/2 (A) differ by 
only 0(1 /F). This is so since p 3/2 (A) and p 3/2 (A) 4- A 3 A/F(1 — A) differ only when 
A = 1 — 0(1/F) in the first place. 

Figure 10-5 shows the set of intersections obtained by varying pA 3 , which may be 
done by varying the density at constant temperature or by varying the temperature at 
constant density. Note that above pA 3 = 2.612, A is essentially equal to unity, being 
different from unity by 0(1/F). This may be immediately seen by noting that the solid 
curve in Fig. 10-4 lies very close to the line A = 1 for values of pA 3 greater than 2.612. 
The point pA 3 = 2.612 represents some sort of a critical point as we shall see below. 

We can summarize the results of Figs. 10-4 and 10-5 mathematically in the following 
way. We wish to solve Eq. (10-50) for A as a function of p and T, that is, we wish to 
solve 

A 3 A 

pA 3 = g V2 (k) + — 


(10-54) 
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Figure 10-5. A plot of A versus A p 3 , obtained from the locus of Intercepts of plots such as the one in 
Figure 10-4. 


for A. Figures 10-4 and 10-5 show that for values of pA 3 less than 2.612, A is given to 
within 0(1, V) by the root of p 3/2 (A) = pA 3 = constant. For values of pA 3 greater than 
2.612, on the other hand, A = 1 — 0(1/F) = 1 — a!V, where a is some positive number. 
We can determine a by substituting A = 1 — a\V into Eq. (10-54) and solving for a: 


A 3 

pA 3 -g v 2 (I) 


(pA 3 > 2.612) 


(10-55) 


where we have written p 3/2 (l) for p 3/2 (l — 0(1/F)) since p 3/2 (A) is a continuous func- 
tion of A. Thus we have 


^ 1 “* y ^ pA 3 > P 3 / 2 O) 

= the root of p 3/2 (A) = pA 3 if pA 3 < p 3/2 (l) (10-56) 

where a is given by Eq. (10-55). In the thermodynamic limit V 00 , 

A = 1 if pA 3 > p 3/2 (l) 

= the root of p 3/2 (A) = pA 3 if pA 3 < p 3/2 (l) (10-57) 

Clearly the point pA 3 = p 3/2 (l) = 2.612 is a special point. To explore its physical 
significance, consider pA 3 = p(h 2 j2nmkT ) 3/2 to be a function of temperature for a 
fixed density. 

At high temperatures such that pA 3 < 2.612, A must be determined numerically 
from the equation p 3/2 (A) = pA 3 . But at low temperatures such that pA 3 > 2.612, 
A = 1 — ajV where a is given by Eq. (10-55). The quantity A/(l — A) is the average 
number of particles in their ground state, and for temperatures such that pA 3 > 2.612, 
we have 

«o = JZTJ = \ = X5 (P A3 - 03/2(1)) (pA 3 > 2.612) (1 0-58) 


We can write this in a more instructive form by defining a temperature T 0 by 

.2 X 3/2 


pA ° 3 " P {^nkf) ~ 93l2(l) 

In terms of this T 0 then, Eq. (10-58) becomes 

fo = 1 _/n 3/2 - - 

N \T 0 ) 


(10-59) 


T< T 0 


(10-60) 
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For temperatures greater than T 0 , the value of A determined from p 3/2 (A) = pA 3 will 
not be in the neighborhood of A = 1 (unless T = T 0 4- 0(1 V), which is, of course, of 
no practical interest). Let this value of A be denoted by A 0 . Then n 0 = A 0 '(1 — A 0 ) and 
so n 0 N would be vanishingly small. Thus, at temperatures above T 0 , 

5 = 0 T>T 0 (10-61) 

Equations (10-60) and (10-61) for n 0 N are plotted in Fig. 10-6. 

It can be seen that when T > T 0 , the fraction of molecules in their ground state is 
essentially zero. This is the normal situation, where the molecules are distributed 
smoothly over the many molecular quantum states available to each one. However, as 
the temperature is lowered past T 0 , suddenly the ground state begins to be appreci- 
ably populated, and the population increases until at T = 0 all the molecules are in 
their ground state. The fact that one state (the ground state) out of the many available 
to each molecule starts to become greatly preferred abruptly at T = T 0 is analogous 
to an ordinary phase transition. This “condensation” of the molecules into their 
ground states is called Bose-Einstein condensation. 

A similar result is found if the temperature is held fixed and the density is allowed 
to vary. There is a critical density, above which the ground states of the molecules are 
not preferred, and below which the molecules tend to populate their ground state. The 
analog of Eqs. (10-60) and (10-61) is (see Problem 10-36) 


= 1 Po 

N p 

= 0 


P >Po 
P<Po 


(10-62) 


To determine the properties of a Bose-Einstein condensation, consider the equation 
of state. From Eq. (10-51), the pressure is given by 


_P_ 

kT 


= ^ 5 / 2 a) - -^ln(l - A) 


(10-63) 


We may neglect the logarithm term here, since its greatest value is achieved when 
1 — A = ajV. In this case the second term is (\IV)\n{\jV) 9 and this vanishes as V -> oo 
since x In x -* 0 as x -* 0. Thus Eq. (10-63) is simply 

< 10 - 64 ) 


where A is determined in terms of p and T by the arguments surrounding Figs. 10-4 and 
10-5, which are summarized in Eq. (10-56). 



T 

Figure 10-6. The fraction of particles in their ground state as a function of temperature. 
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If we consider p k T to be a function of the density for fixed temperature, we can 
write 


_P_ 

kT 


■^ 3 ^/ 2 ^ 

= ^3 #5/2(0 


P <Po 

P>Po 


(10-65) 


where g 5f2 ( 1) = £01) = 1.342 The crucial point is that for p > p 0 , p\kT is inde- 
pendent of the density and hence appears as a horizontal line when p kT is plotted 
versus p. For p < p 0 , on the other hand, pjkT is a function of p that can be deter- 
mined numerically. 

It is customary to plot isotherms of p versus v rather than p versus p, and these are 
shown in Fig. 10-7. Note that these isotherms are very similar to the isotherms observed 
for real gases. 

The horizontal lines represent that region in which the system is a mixture of two 
phases. The points A and B correspond to the two phases in equilibrium: the con- 
densed phase {A) and the dilute phase ( B ). The dilute phase has a specific volume v 0 , 
and the condensed phase has a specific volume 0. The two phases have the same 
pressure, namely, the vapor pressure, which is given by 


Po(T) — 


(10-66) 


If we differentiate this and compare it to the Clapeyron equation, 

dp Aif cond 
dT T AV 


(10-67) 



Figure 10-7. The pressure-volume isotherms for an ideal Bose-Einstein gas. The points A and B cor- 
respond to the two phases in equilibrium. 
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we see that there is a heat of transition associated with this process that is given by 
(see Problem 10-20) 

0 °- 68 ) 

03 / 2 ( 1 ) 

Therefore the Bose-Einstein condensation is a first-order process. This is a very unusual 
first-order transition, however, since the condensed phase has no volume, and the 
system therefore has a uniform macroscopic density rather than the two different 
densities that are usually associated with first-order phase transitions. This is often 
interpreted by saying that the condensation occurs in momentum space rather than 
coordinate space, particularly since from a classical point of view, the particles in the 
condensed phase are found in the same region of momentum space, namely, zero 
momentum. 

One observes similar behavior in the other thermodynamic functions. The heat 
capacity has a particularly interesting property. The thermodynamic energy is given by 

3 kTV 

E = ^ 9 M (10-69) 

which becomes 


E 3 kTv 

A r ffs > 2 ^ T>T o 

= 5 05,2(0 t <t 0 


(10-70) 


We differentiate these with respect to T at constant N and V, we get (see Problem 
10-19) 


— - — — o (A) 9 g3/2 ^ T > T 

M~ 4 g m (X) T T ° 

15 v 

= “4" ^3 05/2(1) T < T 0 


(10-71) 


Again, A must be determined numerically from Eq. (10-50), and the result of this is 
shown in Fig. 10-8(a), where C v is plotted against T. There is no discontinuity in 
C v at T 0 , but there is a discontinuity in the slope there. 




T/T c T° K 

Figure 10-8. (a) The heat capacity of an ideal Bose-Einstein gas and (b) the experimental heat capacity 
of liquid helium under its saturated vapor. 
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Bose-Einstein condensation is an interesting phenomenon since it occurs even 
though the particles do not interact with each other through an intermolecular 
potential. Indeed, however, there is an effective interaction through the symmetry 
requirement of the iV-body wave function of the system, and it is this effective inter- 
action that leads to the condensation. Even though the results derived here are valid 
only for an ideal gas, there is a real system to which they are approximately applicable. 
Helium exists in the form of two isotopes: He-3 and He-4. The thermodynamic 
properties of the pure isotopes have been extensively studied, and it turns out that 
He-4, which has a spin of zero and therefore must obey Bose-Einstein statistics, ex- 
hibits many remarkable properties, one of which being the heat capacity curve shown 
in Fig. 10-8(b). The similarity between this experimental curve and the Greek letter A 
has led the transition to be referred to as a 44 lambda transition.” Although the heat 
capacity appears to diverge logarithmically at T = 2.18°K, the similarity between it 
and the heat capacity curve of an ideal Bose-Einstein gas shown in Fig. 10-8(a) is 
striking. One cannot expect complete agreement since liquid He-4 is an inextricable 
combination of quantum statistics and intermolecular interactions, but it appears that 
the experimental heat capacity is due in part to the quantum statistics of the Bose- 
Einstein He-4 system. Furthermore, liquid He-3, which obeys Fermi-Dirac statistics, 
does not have any unusual behavior in its heat capacity curve, just as the heat capacity 
curve of an ideal Fermi-Dirac gas is 44 normal.” Most intriguing, however, is that if we 
calculate the value of T 0 from Eq. (10-59) (using the density of liquid helium = 
0.145 g/cm 3 ), we find that T 0 = 3.14°K, which is the right order of magnitude. One 
important difference between the A-transition in He-4 and the Bose-Einstein condensa- 
tion is that the A-transition is not a first-order transition. Nevertheless, Bose-Einstein 
statistics seem to play an important role in the A-transition in liquid He-4. 


10-5 AN IDEAL GAS OF PHOTONS (BLACKBODY RADIATION) 

In this section we shall apply statistical thermodynamics to electromagnetic radia- 
tion enclosed in a fixed volume V and at a fixed temperature T. The experimental 
system is obtained by making a cavity in any material, evacuating the cavity, and then 
heating the material to the temperature T. The atoms of the walls of the cavity con- 
stantly emit and absorb radiation, and so at equilibrium we will have a cavity filled 
with electromagnetic radiation. Such a cavity is called a blackbody cavity, and the 
radiation within the cavity is called blackbody radiation. 

The quantum mechanical theory of electromagnetic radiation tells us that an 
electromagnetic wave may be regarded as a massless particle of spin angular momen- 
tum ft = h In and with a momentum and energy that are functions of the wavelength. 
These massless particles are called photons. Since photons have a spin of 1 (in units of 
ft), they form an ideal Bose-Einstein gas. A new feature here is that since the walls 
of the blackbody cavity are constantly emitting and absorbing photons, the number 
of photons is not fixed at any instant, and thus N is not an independent thermo- 
dynamic variable. Thermodynamically, the blackbody cavity is described by V and T. 

Let us briefly review some of the results of vibratory motion. In what follows, we 
shall consider only the 44 electro ” part of an electromagnetic wave, since this is several 
orders of magnitude more important for our purposes. Consider a harmonic electro- 
magnetic wave of unit amplitude traveling with velocity c in the positive x-direction. 
Mathematically, this is described by 


E(x 9 t) = sin 



(10-72) 
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and is called a traveling wave. The symbol k is the wavelength of the wave. Physicists 
customarily write this equation in the form 

E(x 9 t) = sin(fc>c — cot) (1 0-73) 

where k = 2n;k and is called the wave vector, and co = 2nv. In some sense, this wave 
also describes a photon with energy and momentum given by 

e = hv = hco = tick 

h 

momentum = - = hk (1 0-74) 

A 

We shall consider blackbody radiation at equilibrium to be a system of standing 
waves set up in the cavity. It is not necessary to do this, since as the volume becomes 
large, the boundary conditions chosen have no effect on the thermodynamic proper- 
ties, but imposing a boundary condition of standing waves is as convenient as any. 
Consider the superposition of two traveling harmonic waves of unit amplitude and 
traveling in opposite directions. This is given by 

4>{x 9 0 = sin(fc>c — cot) + sin (kx + cot) 

= 2 sin kx cos cot (10-75) 

This new wave does not move either backward or forward. It vanishes at the values 
of x for which sin kx = 0 for all values of t. These points are called nodes. They 
occur at the values of x given by kx = nn 9 where n = 1, 2, .... In between the nodes, 
the disturbance vibrates harmonically with time. Figure 10-9 shows the first few stand- 
ing waves that can be set up between 0 and L. The appropriate boundary condition is 
that the standing wave be fixed at the end points 0 and L, which implies that the wave 
vector k be given by 

k = — «=1,2, ... (10-76) 

Li 

Note that this is equivalent to saying that there must be an integral number of half 
wavelengths between 0 and L, that is, n(k\T) — L. 

In order to discuss three-dimensional waves, it is convenient to use an exponential 
representation rather than a sine or a cosine. In one dimension, a wave of unit ampli- 
tude can be described by 

E(x 9 t) = (10-77) 

Equation (10-73) is obtained by taking the imaginary part of this. This is also a 
harmonic wave traveling in the positive x-direction. A standing wave in this represen- 
tation is given by the imaginary part of 

E(x 9 t) = e Kkx ~ mt} + e iikx+at) 

= 2e ikx cos cot (1 0-78) 



ii = 3 



Figure 10-9. The first three standing waves set up between 0 and L. The positions of the nodes do not 
move with time, but the wave vibrates harmonically up and down between the nodes. 
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The boundary condition given in Eq. (10-76) is obtained from this by requiring that 
the imaginary part of Eq. (10-78) vanish at the endpoints 0 and L. Remember that 
the imaginary part of e ie vanishes whenever 6 is an integral multiple of n. 

In three dimensions, an electromagnetic wave is described by 

E(r, t) = <re l(kr ~ cof) (10-79) 

This represents a traveling wave propagated in the direction of the wave vector k. 
The wavelength X is given by |k| = 2 njX. The wave propagates in the direction of k, 
but the direction of E itself is in the direction of <r. The vector <r is called the polariza- 
tion vector and is perpendicular to k. Thus E represents a transverse wave, that is, one 
in which the disturbance vibrates in a direction perpendicular to the direction of 
propagation. In fact, E vibrates in the plane perpendicular to k, and hence there are 
two and only two independent polarization vectors. 

A standing wave can be formed from Eq. (10-79) (where it is understood that we 
take the imaginary part): 

E(r, t) = 2<re ik ’ r cos cot (1 0-80) 

If we let the components of the wave vector k be k x , k y , and k z , the imaginary part of 
this vanishes when k x L = n x n y k y L = n y n, and k z L — n z n, or in vector notation, 
when 

k = — n n x , n y9 n z = 1, 2, ... (10-81) 


We have assumed for simplicity here that the volume is a cube of length L. The energy 
and momentum are given by 

e = fic|k| = tick 

momentum = ft|k| = hk (1 0-82) 

Note that the energy depends only upon |k|. We are going to need an expression 
for the number of standing waves with energy between £ and £ + dz. This is found by 
the same method that we used to find the number of translational energy states between 
£ and £ -h dz in Section 1-3. The square of k is 

n 2 

k 2 = J 2 0 n x + V + Hz 2 ) n x> n y > n z = 1, 2, . . . (10-83) 


Thus the number of standing waves with the magnitude of the vector less than k is 




L 3 k 3 _ Vk 3 
6n 2 6 7t 2 


(10-84) 


and the number between k and k and k + dk is 
/fW( Vk 2 dk 

w(k) dk = — dk= 2 (1 0-85) 

dk 2n 


According to Eq. (10-82), the energy £ = tick. Furthermore, there are two polariza- 
tions with the energy £, and so we can write 


co(fi) de = 


Ve 2 de 


( 10 - 86 ) 


for the number of standing waves with energy between £ and £ + de. 
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The total energy of the system is given by 

£({«*}) = £«*"* (10-87) 

k 


where the number of “ particles ” with energy e k (actually tick) is n k = 0, 1, 2, . . . since 
photons are bosons. The partition function is 


Q(V, T)= X e' fiE({nk}) = V e~ fiTk 
{"*} {**} 




( 10 - 88 ) 


This summation is similar to the one in Eq. (4-18), but in this case there is no restric- 
tion on the set {«*} since the number of photons is not conserved. This is why Q in 
Eq. (10-88) is a function of only V and T rather than the usual N , V 9 and T The 
quantity N is not a thermodynamic variable in this case, since it is not fixed. 
Without the restriction, a summation like the one in Eq. (10-88) is easy since 

e(F,T)=n(l^") 

k \n = 0 / 

All thermodynamic functions are expressed in terms of In Q , which is 
In Q = -I ln(l - «“**) = -£ ln(l - e~ Pt ) (10-90) 

k t 

As usual, this summation can be converted to an integral by introducing the density 
of states and treating e k to be a continuous variable. Thus 


In 6 = 


V 

n 2 c 3 h 3 



— e fie ) de 


(10-91) 


This integral may be readily evaluated by expanding the logarithm and integrating 
term by term: 


V 00 1 r 00 

toe 


2 00 1 


2V 


7 i 2 c 3 h 3 P 3 n = i n 4 n 2 {chp)' 


f(4) 


(10-92) 


where C(4) is the summation, which is a Riemann zeta function [cf. Eq. (10-52)] and 
is equal to 7i 4 /90. (See Problem 1-63.) 

The thermodynamic energy is given by 


E = kT 2 



v 


n 2 V{kTf 
15 (he) 3 


(10-93) 


This result can be used to derive the Stefan-Boltzmann law. In Problem 7-24 it is 
shown that the number of gas molecules striking a surface per unit area per unilftime 
is pv 4, where p is the number density, and v is the average velocity. By analogy, then, 
cE(T)! 4 V is the energy incident per unit area per unit time on the wall of the enclosure 
containing the radiation. Thus if one cuts a small hole of unit area in the wall, the 
energy radiated per unit time 

7i 2 (kT)* 


R = — = 
4V 


m 3 c 2 


= oT 


(10-94) 
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This is known as the Stefan-Boltzmann law, and a is known as the Stefan-Boltzmann 
constant. 

The pressure of the blackbody radiation is 


p — kT 



T 


2{kT) A n\kTf 
n\hc } 4 5(ftc) 3 


(10-95) 


The pressure due to the radiation is negligible except for the highest temperatures. 

It is instructive to go on and calculate the entropy and the chemical potential. The 
entropy is given by [cf. Eq. (2-33)] 


s -*m Q+kT (^r) v 

4n 2 Vk(kT) 3 
4 5(hc) 3 


(10-96) 


If we calculate G = Np from the equation Np = E — TS 4- pV, we find that 


Np = 0 

and since N is not zero, this implies that the chemical potential equals zero for an ideal 
gas of photons. We can prove on thermodynamic grounds that this is true for any 
system in which the number of particles is not conserved. In such a system we can 
write the chemical reaction mA ^ nA, where m and n are arbitrary integers. If we 
apply the criterion that the change in chemical potential A/z = 0 for a system in equi- 
librium, we find that A/z = mp — np = (m — ri)p = 0. But since m — n does not equal 
zero, this implies that p = 0. Since k = exp(/fyz), this also says that we could have 
derived all of the above results by setting k = 1 in the Bose-Einstein formulas (see 
Problem 10-33). 

Before leaving this section, we wish to calculate the energy density at each frequency. 
We can do this by noting that Eq. (10-90) is equivalent to 

lne= -I ln(l -«-»“) 


where co = e/h = ck. If we use this directly in Eq. (10-93), we find that 


* = I 

to 


hcoe pti<0 


1 _ 


(10-97) 


This can be converted to an integral over co by introducing the number of states be- 
tween co and co + dco from Eq. (10-86) and co = e ft. (It is unfortunate that the same 
notation is utilized for the density of states and the frequency, but since the density 
of states always occurs with an argument, such as co(e) or co(A:), there should be no 
confusion. We hesitate writing co(co), however.) If we introduce the density of fre- 
quencies, Eq. (10-97) becomes 

Vh r 00 co 3 

(,0 - 98, 

The thermodynamic energy per unit volume can be written in terms of the energy 
density of each frequency by 

E r“ 

- = p(co, T) da, 

V J o 
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where 


p(co, T)dco = 


h co 3 
^ - 1 


dco 


(10-99) 


This is the famous blackbody distribution law first derived by Planck in 1901. Clearly 
he did not derive this formula in the way that we have since his work preceded, and, in 
fact, led to the development of quantum mechanics. Problems 10-25 through 10-27 
are involved with some well-known results of blackbody radiation. 


10-6 THE DENSITY MATRIX 


In the previous sections of this chapter, we have treated ideal Fermi-Dirac and Bose- 
Einstein gases by applying the results of Chapter 4. The basic equations there are 
derived on the basis of independent particles and so are not applicable when inter- 
molecular forces are present. In this section we introduce a formalism of quantum 
statistical mechanics, which can be generalized to study nonideal quantum systems. 

Let us return now to the completely general equations 


j 

M = ^-YMje~ fiEj 

Q7 J 


(10-100) 

( 10 - 101 ) 


where M is the ensemble average of the mechanical property, and Mj is the quantum 
mechanical expectation value of the operator Ai in the jth quantum state. 

We wish to express Q and M in terms of the quantum mechanical operators 
and M. Let {ipj} be the set of normalized eigenfunctions of and {£)} be the corre- 
sponding eigenvalues, that is. 


Jtfipj = Ejipj ( 10 - 102 ) 

Since iJ/j is an eigenfunction of we can also write 

tf n \pj = Efij/j (10-103) 

for integral n. This allows us to define the result of an analytic function of acting 
on ij/j . In particular, we have 




y izK 

»=o n\ 


ENj = e-Vtyj 


(10-104) 


(In general, functions of operators are defined through their MacLaurin expansions.) 
Thus we have 

e~ p *il/j = e~ fiEj il/j (1 0-1 05) 

We multiply both sides of this equation by ij/j* and integrate over all the coordinates 
involved to get 


e PEj = J pJt *l/j dx 


(10-106) 
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where we have used the fact that exp(— is just a number and that ij/j is normalized. 
The symbol dx represents an integration over all the coordinates on which \pj depends. 
Using Eq. (10-106) then, Q becomes 

Q = £ e~ fE J = £ f dx (1 0-1 07) 

j J J 

Integrals of quantum mechanical operators, such as exp(— /taf) above, between wave 
functions occur naturally in the matrix formulation of quantum mechanics and are 
called matrix elements. They can be represented by the notation 

J dx = (10-108) 

In this notation, the canonical partition function is 

e = I jj = Tr(e~ fi *) (10-109) 

j 

where we have employed the notation of representing a summation over the diagonal 
elements of a matrix by Tr, which stands for “ trace.” 

Now there is a standard theorem of matrix algebra which says that the trace of a 
matrix is independent of the particular function ^ in Eq. (10-108) used to calculate 
the matrix elements. This theorem is easy to prove. Let {ipj} be the orthonormal set of 
eigenfunctions of and let {$„} be any orthonormal set of functions that can be 
expanded in terms of the ^ as 

= ( 10 - 110 ) 
n 

where the a jn ’s are constant. We can calculate the a Jn 9 s by multiplying both sides of 
eq. (10-110) by integrating over all values of the coordinates, and using the fact 
that the ^/s are orthonormal: 

Oj„ = j 'P*4>j ^ (10-111) 

Furthermore, since the <f>fs are normalized, we have that the a jn must satisfy the con- 
ditions (see Problem 10-34) 

l 00-112) 

n 

We shall also need the coefficients for the expansion of ij/j in terms of the 0’s. Write 

= (10-113) 

I 

The b st can be related to the a jn by multiplying both sides of this equation by <p t * and 
using the fact that the 0 f ’s are orthonormal: 

K, = J 4>,*4> s dx = a,* (10-114) 

where we have used Eq. (10-1 1 1) to write the last equality. Thus b t j s are obtained from 
the a t /s by reversing the subscripts and taking the complex conjugate. Since the ip's 
are normalized, we have 

Z bj* b J» = I a »* a »j = 1 

n n 


( 10 - 115 ) 
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Note that this equation is similar to Eq. (10-1 12). Here we sum over the first subscript, 
and there we sum over the second. We shall need this relation to prove the theorem 
referred to above, namely, that 


Q = Z / 'i / j* e dx = X !^* e dr 0 o-i 1 6) 

To prove this, we substitute Eq. (10-110) into Eq. (10-116) to get 

*e-^4>j dx=Y a Jm* a jn e~ PEn J dx ( 10 - 11 7 ) 

m, n J 

But the integral is just a Kroenecker delta S mn since the set {ipj} is orthonormal. Thus 
the double summation over m and n becomes a single summation, and we have 



J 4 >j*e p * 4 >jdx = '£ a j»* a jne-‘‘ E " (10-118) 

We now sum both sides over j and use Eq. (10-115) 

E f 4>j* e ~ P3e 4> j d * = 'Z e~ pEn = £ e~ pEj = Q (10-119) 

j J n j 

which shows that the trace of exp (— ) is independent of the particular orthonormal 
set of functions used to compute the matrix elements. 

Equation (10-109) gives Q in terms of the quantum mechanical operator We 
now wish to express Eq. (10-101) for M in terms of quantum mechanical operators. 
The ensemble average of M is 


M = 


1, 

5 >-'" 


We first use the fact that is given by 


Mj = J il/j*Mil/j dz 


( 10 - 120 ) 


( 10 - 121 ) 


where A? denotes the quantum mechanical operator corresponding to M, and ipj is 
the eigenfunction of in the jth state. We substitute this into the numerator of 
Eq. (10-120) and perform a series of elementary manipulations: 


£ Mj e fiEj = £ e fiEj f il/j*tihl/j dz 
J j J 


£ J ij/j*Nie fiEj tl/j dz = £ J ij/j*Nie dz 
l (fie-**)jj = Tr 


( 10 - 122 ) 


The invariance of the trace with respect to the functions used to calculate the matrix 
elements says that any convenient orthonormal set of functions could have been used 
to derive this result. 

Using this result for the numerator in Eq. (10-120) gives 


„ Tr(Me _<wr ) 
M ~ Tr (.-*5 


(10-1 23) 
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The denominator of this expression is a scalar, and hence we can define a new operator 
P by 


P = 


e~** 

Tr(e-^) 


(10-124) 


and write for Eq. (10-123) 
M = Tr(Mp) 


(1 0-1 25) 


The operator p is the quantum mechanical analog of the equilibrium density of points 
in the phase space for an canonical ensemble. The matrix corresponding to p is called 
the density matrix. Equation (10-123) corresponds to the classical expression 


j dp dqM(p,q)e 

M ~ J J dp dqe-W'* 


(10-126) 


One can prove that the trace operation in Eq. (10-123) goes into the phase space inte- 
gration in Eq. (10-126) as h -►0. 

We have derived the above equations and defined the density matrix p only for a 
canonical ensemble. The density matrix can be defined in a much more general manner, 
but since we shall need only the results for a canonical ensemble in this book we simply 
refer to Tolman (see “Additional Reading”), who gives a complete discussion of the 
density matrix. Tolman shows that p can be defined for nonequilibrium systems, giving 
the quantum mechanical correspondence to the density of phase points in phase space. 
There is, for example, a quantum mechanical analog of the Liouville equation. 


10-7 THE CLASSICAL LIMIT FROM THE 

QUANTUM MECHANICAL EXPRESSION FOR Q 

In this section we shall derive the classical mechanical form of the canonical parti- 
tion function (an integration over phase space) directly from its quantum mechanical 
form (a summation over energy states). In other words, we shall show that 

Q = Ze~ fiEj n^3nJ j e ~ PH dp 1 --dp N dq l -dq N 

In Chapter 7 we presented an argument that was meant to make this correspondence 
plausible, but here we shall present a rigorous treatment due originally to Kirkwood.* 
In particular, we shall show how to derive an expansion for Q in powers of ft, with the 
leading term being the classical limit. Such expansions were derived in Sections 10-1 
and 10-3, but there the results were valid only for ideal systems. Here we shall relax 
this restriction. The quantum mechanical level of this section is somewhat higher than 
most of the others in this book, and this section can be omitted on first reading, since 
the material presented here is not necessary in the development of later sections. For 
those who do choose to read this section, however. Appendix B presents a brief 
discussion of Fourier transforms, delta functions, and so on. 

We start with Eq. (10-1 16) for a system of N monatomic particles, namely, 

e = lj 4>j*e- p *’4>j dr (1 0-1 27) 

* J. G. Kirkwood, Phys. Rev., 44 p. 31, 1933; 45, p. 116, 1934. See also Hill in “Additional Reading.” 
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where we have written dr = dx t dy l • - * dz N for dr. The functions 4>j(r u . . . , r N ) in this 
case are taken to be the eigenfunctions of the Hamiltonian operator and form a 
complete orthonormal set. The Hamiltonian operator is 

je = jr + u=-!f-l l V , 2 + t/( r„...,r„) 

2m 1=1 

We wish to transform Eq. (10-127) into an integration over phase space. To accom- 
plish this, we introduce the eigenfunctions «(p 1 , r N ) of the momentum op- 

erator — ih\7: 

«(Pi. • - ■ > r/v) = exp^ £ p* • r fc J (1 0-1 28) 

Note that — iftV / w = p i w. We now expand the <j>j in Eq. (10-127) in terms of the 
momentum eigenfunctions 

H r i r w) = /••• J ^/Pi» - • • , P w )exp^ • r*j dPi ' " dp N (10-1 29) 

The 4>j in Eq. (10-127), being eigenfunctions of Jf, are either symmetric or anti- 
symmetric in the coordinates of the N particles. Thus they should be expanded in terms 
of linear combinations of the u( p l5 . . . , r N ) in Eq. (10-128) that themselves are sym- 
metric or antisymmetric in the coordinates of the N particles. We could do this by 
introducing a permutation operator but the resulting equations become fairly 
involved. For simplicity, we shall ignore this complication and simply use Eq. (10-129). 
The result of this is that we shall not derive the factor of Nl that occurs in the classical 
partition function. Thus the central result of this section, Eq. (10-133) with 
Eqs. (10-135), (10-138), through (10-140), is lacking an Nl in its denominator. This 
factor is included, however, in Kirkwood’s original paper and in the more pedagogical 
discussion of Hill. 

Equation (10-129) shows . . ., r N ) as a Fourier transform of Aj, and so we 

can use the inversion theorem of Fourier transforms to write 

A/Pi, • • • , P N ) = (2^51? J • ‘ ‘ f <P/ T i’ •••> r w)exp[- Y,Pk • r k ] dr , • • • dr N 

(10-1 30) 


We now substitute Eq. (10-129) into Eq. (10-127) to get 

Q = X J ' ' * * / $*(? i> •••* t n) A j{ P», ... P N )e~ pjr exp^ £p k • r k j dp,- • dr N 

and then use Eq. (10-130) for Aj (with r/, . . . , r N ' replacing r x , . . . , r N as the variables* 
of integration) to give 


Q 



x 


, . . . , r w )0/r,', . . . , r w ')jexp j- £ p* • r k 'j 
exp(-p^)exp^ £ p k • r k J dp, • • • dr, • • • dr,' • • • dr/ 
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The summation over j in the braces is £(ij — r t ', . . . , r w — r N ') (cf. Problem B-7), and 
so the integration over the primed variables simply gives 

Q = py / ' * * J ex P [“ ^ E P* ’ r *] exp(-^)exp [j: E P* • **j <*Pi ‘ ‘ ’ dr " 

(10-1 31) 

Note that J? contains the operators V/ 2 and so does not commute with 

Thus the order of the terms in the integrand of Eq. (10-131) is important. 

Equation (10-131) is in the desired form of an integral over phase space, but the 
integrand contains the quantum mechanical operator exp( — pjjf) instead of the 
classical function exp (—pH). Kirkwood defines a function wfo, . r N , /?) by the 
relation 


exp(-/?^)exp l - £ p* • r*j = exp(-/?if)exp^ I P* * r k w<p„ . . . , r w . 


P) 


= F( Pl ,...,r w ,/S) (10-132) 

Note that this function w(p l5 r N , P) has been defined such that Eq. (10-131) 
becomes 


C = J-" J e xp(-^H)w(p„..., r w ,/?)^i ••• (10-133) 

Remember that this expression for Q is missing a factor of Nl in the denominator 
since we have chosen to ignore the symmetry properties of the <f>j in Eq. (10-127). This 
equation for Q shows that the quantum corrections to the classical partition function 
lie in the function w(p l5 . . . , r N , P), and so we must now investigate this function. In 
particular, we wish to show that w(p x , . . . , r N , P) 1 as h 0. 

It is possible to evaluate w by carrying out the operation on the left-hand side of 
Eq. (10-132) and comparing the result to the right-hand side, but this turns out to be 
an extremely tedious route. A more convenient way is to differentiate Eq. (10-132) 
with respect to P to get 

= b (‘ “ ** + + " ’Mi' ? ' ' '*] 

= (10-134) 

This differential equation is called a Bloch differential equation and in this case has 
the boundary condition 

F(P = 0) = expjj- E P* ' r * 

It is not possible to solve this equation in general (note its similarity with the 
Schrodinger equation), but it is fairly straightforward to determine the first few 
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coefficients in an expansion of F in powers of h. We do this by actually expanding not 
F but w according to 

00 

MPu . . . , r N ,P)= Y feWp„ ...,r N ,P) (10-135) 

/= 0 

This equation defines the functions w t . We now substitute this expansion into the 
right-hand side of Eq. (10-132) and that result into Eq. (10-134), and after some 
amount of work and cancellation of exp(— pH) and 



we get 

+ ~~ + 0(fc 2 )J - H{w 0 + ftw, + 0 (ft 2 )} = - U{w 0 + h Wl + 0(h 2 )} 

- K{w 0 + hw t + 0 (ft 2 )} + — ( Y Pi • V,- w 0 - Pw 0 Y Pi • V, L/| + 0(h 2 ) 

m h=i i = i ) 

(10-136) 

where K is the sum of the kinetic energies of all the particles. To arrive at this result, 
we have used the following relations: 

Ve'tn = e~ fiK Ve~^ u = -pe~ fiH VU 

V 2 e~ pH = -pe~ fiH V 2 U + P 2 e~ fiH (VU) 2 

V; exp[i ^ P. • - j Pj “p[j 

V «p[j I R ' ^ Kj exp[i J p, ■ r.] 

where Kj is the kinetic energy of the yth particle. 

The coefficient of h to the zero power in Eq. (10-136) gives dw 0 /dp = 0 or yv 0 = 
constant. The value of the “constant” can be found from the boundary condition 

F(p = 0) = expjj- Y Pic • r*j 

From the defining equation for F(P), that is, Eq. (10-132), we see that this is equivalent 
to the condition 

w(Pi r N ,P = 0) = 1 

and so Eq. (10-135) reads 

1 = w'oCPi. • • • ,t n , p = 0) + hw t ( p u . . . ,rjv, P = 0) + ft 2 w 2 (p„ . . . ,r N , p = 0) + • • * 
Since the w t are independent of h , this implies that 
w o(Pi> • • • > r N » P = 0) = 1 

^i(Pi , . . . , r* , p = 0) = 0 / > 2 (10-137) 

But we have seen that w 0 is a constant independent of /?, and so we have generally that 
w 0 = 1 (10-138) 
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and a formal proof that the quantum mechanical partition function goes over into 
the classical limit according to Eq. (7-9). Note that the classical limit is obtained in 
two ways. The more obvious one is to let h 0, but it is also obtained from the limit 
P = 0 [cf. Eqs. (10-137)]. 

We can calculate the first quantum correction to w(p l5 . . . , r N , ft) by comparing the 
coefficients of h to the first power in Eq. (10-136), namely. 


dw t 


W " 


which upon integration gives 


N 


ip 2 v „ 


VjU 


(10-1 39) 


Since this term is odd in the momenta, its contribution to Q according to Eq. (10-133) 
will vanish. (See Problem 10-35.) The contribution from w 2 does not vanish, however, 
and represents the first correction to Q . The evaluation of w 2 follows along the same 
lines as the evaluation for w 0 and w u but is quite a bit more lengthy. The result is 

"» ■ - - Si {? ' ™ - 7 [ (W)I + ■' H + & » ■ ■ ™>’} 11 401 


where we have used the abbreviated notation 


N 

p • a = Z p j'*j 

J = 1 

In Chapter 15, Eq. (10-140) is used to calculate the first quantum correction to the 
thermodynamic properties of imperfect gases. 
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PROBLEMS 

10-1. Referring to the discussion following Eq. (10-5), derive expressions for the thermo- 
dynamic functions A, G, \i 9 and S from E and p as functions of TV, V 9 and T. 

10-2. Derive a virial expansion for E, A, and S for a Fermi-Dirac ideal gas. 

10-3. Derive a virial expansion for E, A, and S for a Bose-Einstein ideal gas. 

10-4. Derive Eqs. (10-11) and (10-12) from Eqs. (10-9) and (10-10). 

10-5. Consider the power series 


Z = * + 27 + 3!' + 


Invert this to find x as a power series in z . Compare your result to the expansion of ln(l + z), 
since z above is actually e x — 1 . 

10-6. Give numerical estimates of the Fermi energy, fx 0 , and of 7> = fM 0 /k for (a) electrons 
in a typical metal such as Ag or Cu, (b) nucleons, for example, neutrons and protons, in a 
heavy nucleus, (c) He 3 atoms in He 3 gas, in which the volume available to each atom is 50A 3 . 
Treat the particles as free fermions. 

10-7. The density of sodium metal at room temperature is 0.95 g/cm 3 . Assuming that there 
is one conduction electron per sodium atom, calculate the Fermi energy and Fermi tempera- 
ture of sodium. 

10-8. Estimate the pressure of an ideal Fermi-Dirac gas of electrons at O K. 

10-9. Show that the derivative of /(e) in Eq. (10-20) is symmetric about p and that 

f* f'(e)de =- 1 

J — 00 


where f\e) = dfide. 

10-10. Prove that the expression e x /(l + e x ) 2 is an even function of x. 
10-11. Prove that 


i. 


x s e x dx 

(1 + e x Y 


j odd 


CO 

= -200 2 

n= 1 


(- 1 )" 

n J 


— 2 (/ 0 1 ? 0 ) 


/eve n 
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The summation *?(j) is closely related to the Riemann zeta function (c/. Abromowitz and 
Stegun) and has the values *?( 2) = 7 t 2 /12, *?(4) = 77^/720, .... 

10-12. Carry Eq. (10-36) one term further and show that 


( t r 2 (kT \ 2 t r 4 (kT\* \ 

10-13. Show that for an ideal Fermi-Dirac gas that 
2 yVju 0 I 5t r 2 /m 2 n* (kT\ A \ 

p= s~r ( l+ irw + ‘ j 


10-14. Show that at 3000' K, /x for aluminum differs from /x 0 by less than 0.1 percent 
(^io = 1 1 .7 eV). 

10-15. Show that for an ideal Fermi-Dirac gas, the Helmholtz free energy is 


I Sir 2 (kT\ 2 it* / kT\ * \ 

+ •! 


and that the entropy is 


10-16. Take Eq. (10-37) one term further and show that 
f 5 t t 2 /kT\ 2 7 t* (kTY \ 


10-17. Show that 
CK = yA 2 7/W 

is the constant volume heat capacity of an ideal Fermi-Dirac gas if fx 0 > kT \ where /(e) is 
the density of states. 

10-18. Consider a system in which the density of states of the electrons f(e) is 

/(e) = constant = D e > 0 
= 0 e <0 

Calculate the Fa-mi energy for this system; determine the condition for the system being 
highly degenerate; and then show that the heat capacity is proportional to T for the highly 
degenerate case. 

10-19. Derive Eq. (10-71) for the constant volume heat capacity of an ideal Bose-Einstein 
gas. 

10-20. Prove that the heat of transition associated with Bose-Einstein condensation is 
given by Eq. (10-68). 

10-21. Show that 0„(A) defined by Eq. (10-47) obeys the following recursion formula: 


ffn-l — 


SflnA) 

Also show that for A close to unity, that 

0SI2 (X) = 2.363(— In A) 3/2 + 1.342 + 2.612 In A - 0.730 (In A) 2 + • • • 

From these two results, show that the discontinuity of ( SCy/ST ) aX T=T C for an ideal Bose- 
Einstein gas is 

/0Cy\ /dCy\ 


3.66 Nk 
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10-22. Consider an ideal Bose-Einstein gas in which the particles have internal degrees of 
freedom. Assume for simplicity that only the first excited state need be considered and that 
this has an energy e relative to the ground state, taken to be zero. Show that the Bose-Einstein 
condensation temperature of this system is given by 

T c = r c °{l - 0.255c -‘ ,kTc ° 4- •••} 

assuming that e~ mTc ° 1. How are the thermodynamic functions affected by e. 

10-23. Does a two-dimensional Bose-Einstein ideal gas display a condensation as it does 
in three dimensions? 

10-24. Show that in two dimensions the heat capacity CV of an ideal Fermi-Dirac and 
Bose-Einstein is the same. 

10-25. Derive the Rayleigh-Jeans law 

877 V 2 

p(v, T)dv = — r— kT dv 
c 


from the Planck radiation law by considering the limit hv kT \ Derive the Wien empirical 
distribution law 

877 hv * 

p(v, T)dv — — - — e hvlkT dv 
c * 


by considering the high-frequency limit. 

10-26. Show that the Planck blackbody distribution can be written in terms of wave- 
lengths A rather than frequency: 


p(\T)dX 


877 he dX 

\s e h cJXkT _ | 


where p{ A, T) dX is the amount of energy between wavelength A and A + dX. 

10-27. If o> max is the frequency at which p(to, T) is a maximum, illustrate by maximizing 
In p( o>, T) that a)^ is given by 


kT 


— 3(1 — e~ ltw ™** /kT ) 


and so 



Similarly show that 

A m . x r = 0.290 cm-deg 

Calculate the temperature for which A mtx is in the red region of the spectrum. 

10-28. Derive Eq. (10-93) by evaluating the integral in Eq. (10-98). 

10-29. In Problem 7-24, it was shown that the number of molecules striking a surface 
per unit area per unit time is pvj 4. By a similar approach, show that the total energy flux 
radiated by a blackbody is 

c E g 

where o=27T s k*/15h 3 c 3 . This result is known as the Stefan-Boltzmann law, and o is the 
Stefan-Boltzmann constant. Verify that a, a universal constant, equals 5.669 x 10“ 5 erg/cm 2 - 
deg 4 -sec. 

10-30. Show that 
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for a photon gas and compare this to the analogous result for bosons and fermions with 
nonzero rest mass. 

10-31. It has been stated that in the early stages, a nuclear fission explosion generates a 
temperature of the order of a million degrees Kelvin over a sphere 10 cm in diameter. Assum- 
ing this to be true, estimate the total rate of radiation emitted from the surface of this sphere, 
the radiation flux a few miles away, and the wavelength corresponding to the maximum in the 
radiated power spectrum. 

10-32. Derive an expression for Planck’s blackbody distribution law and Stefan’s radiation 
law for a two-dimensional world. 

10-33. Derive all of the principal results for a photon gas by setting A — 1 in the Bose- 
Einstein formulas. 

10-34. Prove Eq. (10-112). 

10-35. Show that the contribution of the term w x [Eq. (10-139)] to the partition function Q 
vanishes, since it is odd in the momenta. 

10-36. Derive Eqs. (10-62). 



CHAPTER 11 


CRYSTALS 


In this chapter we shall discuss the application of statistical thermodynamics to 
the calculation of the thermodynamic properties of crystals. Unlike dilute gases, the 
interatomic interactions in a crystal are not negligible, but we shall see that the 
concept of normal coordinates allows us to treat a crystal as a system of independent 
“ particles.” In Section 1 1-1 we show that all of the thermodynamic properties of a 
crystal can be expressed in terms of the distribution of its normal vibrational frequencies. 
This distribution function is difficult to calculate exactly, but in Sections 11-2 and 
11-3 we discuss two well-known simple approximations, the first due to Einstein 
(Section 11-2) and the other due to Debye (Section 11-3). In Section 11-4 we turn to 
the problem of an exact vibrational analysis of a crystalline solid. We shall determine 
the exact vibrational spectrum of two types of one-dimensional lattices. Although the 
results for one-dimensional lattices are not directly applicable to real crystals, the 
basic ideas and techniques associated with such a calculation serve as an introduction 
to the field of lattice dynamics, that is, the calculation of the vibrational spectrum of 
more realistic lattices. The final two sections of the chapter contain a discussion of 
two important topics in the statistical thermodynamics of crystals. In Section 1 1-5 we 
introduce the concept of a phonon, and in Section 11-6 we discuss several of the most 
important types of defects or imperfections that occur in real crystals. 

11-1 THE VIBRATIONAL SPECTRUM OF A MONATOMIC CRYSTAL 

In this section we shall derive the partition function for a monatomic crystal. 
Although we can hardly ignore the interatomic (or intermolecular) interactions in a 
solid, we shall see that it is, nevertheless, possible to treat a crystalline solid as a system 
of independent “ particles.” The crucial point here is the existence of normal coordi- 
nates. For many purposes, a crystal may be represented by a system of regularly spaced 
masses and springs, as illustrated two dimensionally in Fig. 1 1-1. The springs represent 
the resultant interatomic force that each atom “sees” about its lattice point Each 
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Figure 11-1. A two-dimensional version of a mass and spring model of a crystalline lattice. 

atom sits in a potential well whose minimum is at a lattice point, and as the atom 
moves from its equilibrium position, the net force restores it to its equilibrium position. 
Thus the effect of the intermolecular forces between all the atoms in a crystalline solid 
may be represented by a system of springs as illustrated in Fig. 1 1-1. Clearly the force 
constants that we assign to these springs reflect the interatomic forces involved. 

In a typical crystal, the potential well that each atom “sees” is very steep, and so 
each atom vibrates about its equilibrium lattice point with a small amplitude. This 
allows us to expand the interatomic potential of the entire crystal in a Taylor series. 
Consider the one-dimensional example shown in Fig. 1 1-2. The total potential energy 
is a function of the displacements of the N atoms from their equilibrium positions, 
that is, U = U(€ lt f 2 , £*). Since the atoms vibrate with small amplitude about 

their equilibrium positions, we write 





/-I /•+! 


nnnr'— o — nrrr 


$/-i % £/+ 1 

Figure 1 1-2. A one-dimensional system of masses and springs. The npper system is in its equilibrium 
configuration, where all the masses are separated by a distance a 9 the unstrained length 
of the spring. The lower curve shows an arbitrary configuration which is described by the 
set of displacements {£,} of the atoms from their equilibrium positions. 
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where the zero subscript on the derivatives indicated that they are to be evaluated at 
the point at which all the £,• equal zero. Since V is a minimum when the ^ = 0, the 
first derivatives in Eq. (1 1-1) are zero, and so we have 


U(Z» Hz , . . . , U = 1/(0, o, .... 0) + i £ Sj + 


= l/(0,0,...,0 ) + Wk u t,tj + 

2 i.j 


( 11 - 2 ) 


where we have introduced the set of force constants k u . The important result here is 
that f 2 , • ••, €n) IS a quadratic function of the displacements. Note that 

1/(0, 0 9 ...» 0) is a function of the lattice spacing, which in turn is a function of the 
ratio V/N , or the density. To emphasize this, we shall write 1/(0, 0, . . . , 0) as 1/(0; p). 
We are taking the zero of energy to be the separated atoms at rest. In addition, the 
curvature of l/(f t , >••*>£*) at the minimum is a function only of V/N, and so the 
force constants are functions of V/N only. 

Equation (11-2) represents a system of coupled harmonic oscillators. We say that 
they are coupled because of the cross terms in Eq. (11-2). If it were not for these 
cross terms, Eq. (11-2) would be a sum of independent squared terms, and the 
Lagrange equations of motion (classically) or the Schrodinger equation (quantum 
mechanically) would yield N separate or uncoupled harmonic oscillators. This is 
similar to the situation that occurred when we treated the vibration of a polyatomic 
molecule in Section 8-1, and, in fact, a crystal containing N atoms can be considered 
to be just a large polyatomic molecule. We saw in Chapter 8 that the vibrational 
motion of a polyatomic molecule can be rigorously decomposed into a set of indepen- 
dent harmonic oscillators by introducing normal coordinates. In principle, we can 
apply such a normal coordinate analysis to an entire crystal. If there are N atoms in a 
monatomic crystal, there are 3 N degrees of freedom, of which three are associated 
with the translational motion of the whole crystal, and three more are concerned with 
the rotation of the crystal. There are then 3N — 6 vibrational degrees of freedom. But 
with N = O(10 2 °), we can take this number of vibrational degrees to be 3N, without 
noticeable error. The result of a normal coordinate analysis would yield 3 N — 6 « 3 N 
vibrational frequencies 

1 /k\ m 

V ' = 2^W j=l,2,...,3N-6*3N (11-3) 


where kj and pj are an effective force constant and an effective reduced mass, respec- 
tively. The precise form of kj and pj are not important for our purposes. The important 
point is that the complicated general vibrational problem can be mathematically 
reduced to 3 N (really 3 N — 6) independent harmonic oscillators, each with its own 
frequency, which is a complicated function of the masses, force constants, and geom- 
etry of the lattice. Since the kj in Eq. (11-3) depends upon the k tj in Eq. (11-2), the 
kj and the frequencies Vj in Eq. (1 1-3) depend upon V/N rather than V or N separately. 

Since the crystal does not translate or rotate,* the complete partition function is 
given by 


Q 



3N-6 


_ £- 1 /( 0 ;p)fkT 


0 *?vib, 
j= 1 


(11-4) 


* More precisely, the translational and rotational degrees of freedom of the crystal contribute negligibly 
to the partition function a per molecule basis. 
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where q vihJ is the vibrational partition function associated with the yth vibrational 
frequency. Note that there is no factor of N! in the denominator of Q. Since each 
molecule is restricted to the neighborhood of its lattice point, and the lattice points 
could, in principle, be labeled, the atoms themselves must be considered to be dis- 
tinguishable. We have already evaluated q vih in our treatment of diatomic gases and 
found that 

e -hv/2kT 

4vib — J _ e ~hv/kT 0*1-5) 


Therefore the total partition function is given by 

3 N / p—hvjIlkT \ 

Q= n\T^^Y Vi ™ kT (11 " 6) 

Since there are 3 N normal frequencies, they are essentially continuously distributed, 
and we can introduce a function g(v) dv, which gives the number of normal frequencies 
between v and v + dv. If we introduce this into the logarithm of Eq. (1 1-6), we have 

-In Q = + J" [ln(l - e~ kv,kT ) + ^g(y) dv (11-7) 

where 


f °° 

g(v)dv = 3N (11-8) 

J o 

since there are 3 N normal frequencies in all. If we can determine the function g(v), 
we can then calculate the thermodynamic properties of the crystal. For example, we 
have 


E = 


U(0; p) + j" 


- hve~ hv/kT 
(1 _ e ~ hv/kT ) 



and 


(11-9) 


r 00 ( hv/kT) 2 e hv/kT g(v) dv 
o (1 - e~ h ^ kT ) 2 


(11-10) 


Equations (11-7) through (11-10) are essentially exact. In order to use them, we 
must know the function g(v\ and this is, of course, where the difficulty lies. The func- 
tion g(v) is easier to determine than the entire set of individual frequencies, but it is, 
nevertheless, a very difficult problem. We shall discuss some exact calculations of 
^(v) in Section 1 1-4, but before that, we shall introduce two useful and well-known 
approximations to #(v). One of these is due to Einstein and says that all the normal 
frequencies are the same; the other is due to Debye, who treated a crystal as a con- 
tinuous elastic medium and calculated #(v) by studying the elastic waves that can be 
set up in such a body. 


11-2 THE EINSTEIN THEORY OF THE SPECIFIC HEAT OF CRYSTALS 

In this section we shall discuss an extremely simple model for the vibrational 
character of a crystal. It is so simple, in fact, that we should go back in time to the 
beginning of this century in order to appreciate its great insight and impact. Classical 
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statistical thermodynamics was fairly well developed by the end of the nineteenth 
century. If it is considered that the N atoms of a crystalline solid behave as harmonic 
oscillators about their equilibrium positions, classical theory (equipartition) predicts 
that each atom would contribute k cal/deg-mole for each of its three vibrational 
degrees of freedom, or that the molar heat capacity at constant volume would be 
3 Nk = 3J? = 6 cal/deg-mole. This prediction, which is known as the law of Dulong 
and Petit, is in good agreement with the observed heat capacity of many crystals at 
high enough temperatures, and often down to room temperature, but the agreement 
fails completely at low temperatures. For example, the heat capacity C v of silver is 
shown in Fig. 11-3. It can be seen that the Dulong and Petit value is approached 
asymptotically, but that the curve falls rapidly to zero as T -► 0. This behavior is 
observed quite generally, and experimentally it is found that the heat capacity goes 
to zero as T 3 as T -> 0. This is known as the T 3 -law and is an experimental observa- 
tion which any successful theory must reproduce. 

At the beginning of this century, deviations from predicted classical behavior were 
being discovered regularly, and each one was a severe challenge to the physical 
theories of the time. Einstein was the first to present a theoretical explanation of the 
low-temperature heat capacity of solids by applying the revolutionary blackbody 
radiation theory ideas of Planck to the vibrations of atoms in crystals. Einstein 
assumed that each atom in the crystal vibrates about its equilibrium configuration as 
a simple harmonic oscillator, so that the entire crystal could be considered to be a set 
of 3 N independent harmonic oscillators, each oscillator having the same frequency v. 
Physically then, he assumed that each atom of the crystal sees the same environment 
as any other, and so all N atoms could be treated as independent oscillators in the 
jc-, y-, and z-directions. Classically such an assumption leads to the Dulong and Petit 
value of 3 R cal/deg-mole, but Einstein’s great contribution (in 1907) was to say that 
the energy of each of these 3 N independent oscillators had to be quantized according 
to the procedure developed by Planck. Thus, with our advantage of using a formalism 
and notation developed long after the turbulent years of the beginning of the century, 
we can say that Einstein assumed that the frequency spectrum g(v) was a delta func- 
tion at one frequency 

g(v) = 3 N3(v - v £ ) (11-11) 


C v (cal/deg-mole) 



Figure 1 1-3. The molar heat capacity at constant volume of metallic silver as a function of temperature. 
(From C. Kittel, Solid State Physics , 2nd ed. New York: Wiley, 1956.) 
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where the factor 3 N is included, so that Eq. (11-8) is satisfied, and v £ is the single 
frequency assigned to all 3 N independent oscillators of the crystal. The value of the 
Einstein frequency v £ varies from substance to substance and, in some way, reflects 
the nature of the interatomic interactions for the particular crystal. In the light of the 
organized development presented in Section 11-1, this may appear to be a terribly 
gross assumption, but it was a major step forward, and “ in a sense the final step,”* 
in the understanding of the heat capacity of solids. 

If we substitute Eq. (11-11) into Eq. (11-10), we find 

/hv E \ 2 e ~ hVE/kT 

c r= Mk \W (1 -«--r 


for the heat capacity. It is customary to define a quantity © £ by Av £ /fc, which has 
units of temperature and is called the Einstein temperature of the crystal. In terms of 
the Einstein temperature, Eq. (11-12) is 


Cy = 3 Nk 



e ~e*/ T 

(1 _ e-ezlT ) 2 


(11-13) 


Equation (11-13) contains one adjustable parameter to fit the entire heat capacity 
curve shown in Fig. 11-3. Figure 11-4 shows a comparison of Eq. (11-13) versus the 
experimental heat capacity of diamond. This figure is taken from Einstein’s original 
paper and shows the success of such a simple theory. It is easy to show from 
Eq. (11-13) that C v approaches the Dulong and Petit value of 3 Nk = 3R as T -► oo. 
(See Problem 1 1-2.) 

Although Fig. 1 1-4 shows that the Einstein model of a crystal is capable of giving 
an impressive qualitative agreement with experiment, it is not in quantitative agree- 
ment. In particular, Eq. (1 1-13) predicts that the low-temperature heat capacity goes as 


Cy 



(11-14) 



0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 


m E 

Figure 11-4. Comparison of the experimental heat capacity of diamond with the prediction based on 
the Einstein theory with S E = 1320°K. (From C. Kittel, Solid State Physics , 3rd ed. 
New York: Wiley, 1967., after A. Einstein, Ann. Physik ., 22, 180, 1907.) 

* From Blackman in 4< Additional Reading.” 
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instead of T 3 as the experimental data. The low-temperature heat capacity predicted 
by Eq. (1 1-14) falls to its zero value more rapidly than T 3 -law. We shall see in the next 
section that the Debye theory, which came a few years after Einstein’s, gives a T 3 -law 
as T -► 0. 

Before going on to the Debye theory, we point out an important feature of 
Eq. (11-13), which is also exhibited in more rigorous theories. Equation (11-13) 
predicts that C v is the same function for all substances if it is plotted versus T/© £ . 
When this happens, we say that C v is a universal function of T/© £ , and that the 
various crystals obey a law of corresponding states . Once the temperature is scaled or 
“ reduced ” by a quantity that depends upon the particular substance, the heat capacity 
versus the reduced temperature will superimpose for all crystals. Although the Einstein 
model does not quantitatively reproduce experimental data, its prediction of a law of 
corresponding states is, in fact, correct. 


11-3 THE DEBYE THEORY OF THE HEAT CAPACITY OF CRYSTALS 

According to the ideas of Planck, the energy of an oscillator is proportional to the 
frequency, and since it is the lower energies that are populated at low temperatures, we 
can reason that it is the low-frequency or long-wavelength modes that are most 
important at low temperatures. The success of the Debye theory is that it treats the 
long-wavelength frequencies of a crystal in an exact manner, and hence is able to 
predict the low-temperature heat capacity. 

The normal frequencies of a crystal vary from essentially zero to some value of the 
order of 10 13 cycles/sec (Hz) or so. Normal frequencies are not due to the vibrations 
of single atoms, but are a concerted harmonic motion of all the atoms. This concerted 
motion is called a normal coordinate or a normal mode. Note, for example, that the 
normal coordinates of C0 2 and H 2 0 involve the synchronous motion of all the atoms 
in each molecule. Two extremes of normal modes of a one-dimensional crystal are 
shown in Fig. 1 1-5. The upper mode is one in which the atoms vibrate against each 
other and has a wavelength of 2a. The lower one is one in which a long row of atoms 
moves smoothly together to produce a long-wavelength mode of low frequency. It is 
the long-wavelength modes that Debye was able to treat in a clever manner. 

Debye reasoned that those normal modes whose wavelengths are long compared to 
the atomic spacing do not depend upon the detailed atomic character of the solid and 
could be calculated by assuming that the crystal is a continuous elastic body. The 
approximation of the Debye theory is that it treats all the normal frequencies from this 
point of view. 

The distribution of frequencies that can be set up in a solid body is calculated in 
almost the same way that we calculated the set of standing waves in a blackbody cavity. 



Figure 1 1-5. Two types of normal modes in a one-dimensional crystal. The upper one is a high-frequency 
mode and the bottom is a low-frequency mode. 
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The first part of Section 10-5 can be read independently of the rest of the chapter. We 
state there that the imaginary part of 

w(r, 0 = Ae iik r - (at) (11-15) 

represents a wave of amplitude A traveling through a medium in the direction of k 
and with frequency o = 2nv. The quantity k is called the wave vector, and its magni- 
tude is 27i/ vl. The velocity of this wave is given by v = co/k = vL A standing wave can 
be obtained by superimposing two waves traveling in opposite directions. This gives 

u = 2 Ae lk r cos cot 

In order that this be a standing wave, we require that the imaginary part of this vanish 
at the edges of the crystal. If we assume for simplicity that the crystal is a cube of length 
L, this boundary condition gives that k x L = n x n, k y L = n y n, and k z L = n z n , where 
k x9 and so on, are the components of k, and n X9 n yy and n z are positive integers. In 
vector notation, we have 


k = 



( 11 - 16 ) 


The frequency v depends upon only the magnitude of k (through the relation 
a> = vk), which is given by 


k 2 = 



(n x 2 + n* 


+o 


(11-17) 


The number of standing waves with wave number between k and k 4- dk is found 
by the same method that we used to find the number of translational energy states 
between s and e + de in Section 1-3. Using Eq. (1 1-17), the number of standing waves 
with the wave vector of magnitude less than k is [c/ Eq. (10-84)] 




l?k 3 

6n 2 


Vk 3 
6n 2 


and the number between k and k + dk is [cf. Eq. (10-85)] 


d < D 

co(k) dk = — dk = 
dk 


Vk 2 dk 
2n 2 


(11-18) 


We can convert this into g(v) dv by using the relation v = v/X = vkjln: 

. 47iKv 2 

g(v) dv = 3 — dv (1 1-1 9) 

This is almost the desired result. We must recognize that there are two kinds of 
waves that can propagate through a continuous medium. These are transverse waves, 
in which the medium vibrates perpendicular to the direction of propagation (the 
direction of k), and longitudinal waves, in which the medium vibrates in the same 
direction as the wave is propagated. Since it is possible to draw two independent vectors 
perpendicular to k and only one parallel to k, there are two transverse waves and 
one longitudinal wave. The three of these contribute to #(v), and we finally have the 
complete expression for the Debye approximation to #(v), namely. 


2 1 
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In this expression, v t and v t are the transverse and longitudinal velocities, respectively. 
It is conventional to introduce a kind of average velocity by means of 


3 ” . 3 + 3 


( 11 - 21 ) 


so that Eq. (11-20) can be written in the form 


g(v) dv = 


\2nV 


v 2 dv 


( 11 - 22 ) 


This expression is exact in the limit of low frequencies or long wavelengths, where 
the atomic nature of the solid is not important, and the crystal can, in fact, be treated 
as a continuous elastic body. The Debye theory uses Eq. (1 1-22) for all the normal 
frequencies, however. The total number of normal frequencies is 3N, and so Debye 
defined a maximum frequency v D such that the integral of g(v) dv from 0 to v D equals 
3 N. Thus 


f #(v) dv = 3 N 

J o 

which, when Eq. (1 1-22) is used for g(v) 9 gives 
/ 3N \ 1/3 


/3 N\ l/ * 

Vd= (w) v ° 


(11-23) 


(11-24) 


The frequency v D is called the Debye frequency. In terms of v D , the distribution 
function g(v) dv is 

9N 

g(v) dv = — 3 v 2 dv 0 < v < 


V > V, 


(11-25) 


This summarizes the Debye theory of crystals. 

We can now substitute Eq. (11-25) for g(v) dv into Eqs. (11-7) through (11-10) to 
calculate the thermodynamic properties of a crystal according to the Debye theory. 
The most interesting thermodynamic function is the heat capacity C v , given by 
Eq. (11-10) with Eq. (11-25) for ^(v) dv: 


Cy = 9 Nk 


T \ 3 r 0 D/ T 




(«* - D : 


(11-26) 


where we have let x = hv/kT and have defined the Debye temperature by 
° D ~ k 


(11-27) 


The integral in Eq. (1 1-26) cannot be evaluated in terms of simple functions and must 
be evaluated numerically. Note that the integral is a function of only the upper limit 
of the integral, that is, a function of &JT. It is customary to define a function Z)(T/©d) 
by 

/ T \ ,/T\ 3 f e ‘>/ T xV 

D (d ' 3 k) 1 (^TrP dx 


(11-28) 
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so that the heat capacity is 

C v = 3 NkD^^ (11-29) 

The function D(T/G d ) is called the Debye function. It is a well-tabulated function 
of T /& d . (See Appendix C.) Figure 11-6 is from Debye’s original paper and shows 
two comparisons of Eq. (1 1-29) with experimental data. It can be seen that the agree- 
ment is very good. The values of © D are those which give the best overall fit to the 
data. Table 1 1-1 gives the Debye temperatures for many monatomic solids. Note that 
most of these values are of the order of a few hundred degrees Kelvin. 

Table 1 1-1 . The Debye temperature of various monatomic solids 


solid 

0 O ( C K) 

solid 

©dCK) 

Na 

150 

Fe 

420 

K 

100 

Co 

385 

Cu 

315 

Ni 

375 

Ag 

215 

AI 

390 

Au 

170 

Ge 

290 

Be 

1000 

Sn 

260 

Mg 

290 

Pb 

88 

Zn 

250 

Pt 

225 

Cd 

172 

C (diam) 

1860 




Figure 11-6. Observed values and curves calculated on the Debye model for the heat capacity of alu- 
minum and copper, taking © D — 396 K and 309°K, respectively. (After P. Debye, Arm. Phy- 
sik , 39,789, 1912. From C. Kittel, Solid State Physics , 2nd ed.. New York: Wiley, 1956.) 
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Although Eq. (11-29) must be evaluated numerically for arbitrary values of T/Q D , 
it is easy to investigate its high- and low-temperature behavior. At high temperatures, 
the upper limit in the integral in Eq. (1 1-28) becomes very small. Hence the range of x 
is very small and it is legitimate to expand the integrand to get 


.© d /t x V 

o (e* - iy 


.©d/T *4(1 + X + ■ • •) 

o (1 + x + - 1) 




Therefore D(T/G d )-+ 1, and Eq. (11-29) for C v becomes 


C v -► 3 Nk = 3 R = 6 cal/deg-mole 


(11-30) 


which is the classical limiting law of Dulong and Petit. 

The low-temperature limit of C v is more interesting. This can be obtained by letting 
the upper limit of the integral in D(T/G d ) go to infinity. Then 



The integral here is standard and equals 47t 4 /l 5. (See Problem 1 1-6.) The low-tempera- 
ture limit of C v then is 



(11—31 ) 


which is the famous T 3 -law. This was the great triumph of the Debye theory. Although 
it is not obvious from Fig. 1 1-4, the Einstein heat capacity curve falls much too rapidly 
as T -► 0, and the agreement at low temperatures is very poor. This is more readily 
seen in Table 11-2, where the Einstein and Debye theories are compared to experi- 
mental data for silver. It can be seen that although both theories agree for tempera- 
tures greater than approximately 100°K, only the Debye theory is able to be used at 
lower temperature. 

There are several important features of the Debye theory. The one that we have just 
discussed in some detail is that it predicts a T 3 -law for the low-temperature heat 
capacity. Another is that it predicts a law of corresponding states for the heat capacity. 
Equation (11-29) clearly shows that if C v is plotted versus T/G D , all substances will 
lie on the one curve. Another way of saying this is that C v is a universal function for 
all substances, determined by one parameter ©^ in the form of T/Q D . Figure 11-7 
shows the heat capacity data for a number of substances plotted on the same graph of 
C v versus T;Q D . Note that the Debye curve fits all the points over the entire tempera- 
ture range. 

An interesting consequence of Debye’s approach is that it is possible to calculate 
© D in terms of the elastic constants of the solid. We shall not prove it here, but it 
should be clear that such a thing is possible since the Debye theory is based upon 
treating a crystal as a continuous elastic body. The elastic constants of a body are 
quantities such as the compressibility and Young’s modulus. Table 11-3 compares 
the Debye temperatures determined by fitting heat capacity data with those calculated 
from the elastic properties of the solid. The agreement, although not perfect, is quite 
good. 
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Table 1 1-2. Heat capacity of silver at different temperatures 


temperatures 

("K) 

C v (obs.) 

(cal/mole-deg) 

C v calculated 

Einstein Debye 

1.35 

0.000254 

8.76 x 10“*® 


2 

0.000626 

1.39 x 10- 32 

— 

3 

0.00157 

6.16 x 10- 20 

— 

4 

0.00303 

5.92 x 10 -15 


5 

0.00509 

1.62 X 10““ 

— 

6 

0.00891 

3.24 x 10- 9 

— 

7 

0.0151 

1.30 x 10- 7 

0.0172 

8 

0.0236 

2.00 x 10- 6 

0.0257 

10 

0.0475 

1.27 X 10-* 

0.0502 

12 

0.0830 

0.0010 

0.0870 

14 

0.1336 

0.0052 

0.137 

16 

0.2020 

0.0180 

0.207 

20 

0.3995 

0.0945 

0.394 

28.56 

1.027 

0.579 

1.014 

36.16 

1.694 

1.252 

1.69 

47.09 

2.582 

2.272 

2.60 

55.88 

3.186 

2.946 

3.22 

65.19 

3.673 

3.521 

3.73 

74.56 

4.039 

3.976 

4.13 

83.91 

4.326 

4.309 

4.45 

103.14 

4.797 

4.795 

4.86 

124.20 

5.084 

5.124 

5.17 

144.38 

5.373 

5.323 

5.37 

166.78 

5.463 

5.476 

5.51 

190.17 

5.578 

5.581 

5.61 

205.30 

5.605 

5.633 

5.66 


Source: C. Kittel, Solid State Physics, 2nd ed. Mew York: Wiley, 1956. 

One cannot expect perfect agreement in Table 1 1-3 since the Debye theory is, of 
course, an approximate theory. In fact, as more experimental data became available 
in the 1920s, the Debye theory was subjected to more and more severe tests, and 
discrepancies began to appear. For example, one can calculate © D at any particular 
temperature from Eq. (11-29) if the experimental heat capacity of that temperature 
is known. If the Debye theory were exact, the value of © D obtained would be inde- 
pendent of the temperature used to calculate it. However, it turns out that the value 
1.0 


0.8 


0.6 

C v f3R 

0.4 


0.2 


0 0.4 0.8 1.2 1.6 

t;q d 

Figure 11-7. An illustration of the law of corresponding states predicted by the Debye theory. (From 
F. Mandl, Statistical Physics. New York: Wiley, 1971). 
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Figure 11-8. A typical result for the Debye temperature as function of T for a monatomic solid. 


Table 1 1-3. Comparison of Debye temperatures determined from elastic constants or heat capacity 
measurements 


substance 

© 

(elastic) 

°K 

© 

(heat capacity) 

°K 

A1 

399 

396 

Cu 

329 

313 

Ag 

212 

220 

Au 

166 

186 

Cd 

168 

164 

Sn 

185 

165 

Pb 

72 

86 

Bi 

111 

111 

Pt 

226 

220 


of © D does depend upon the temperature at which it is evaluated. This can be shown 
most clearly on a © D versus T plot, where the value of © D calculated at some tem- 
perature is plotted against the temperature. Such a © D -T plot is shown in Fig. 11-8. 
If the Debye theory were exact, © D versus T would be a horizontal straight line, and so 
a deviation from such a straight line indicates a failing of the Debye theory. In the 
main, however, these deviations do not exceed 10 percent in most cases, and so we 
can say that the Debye theory is a successful theory of the thermodynamic properties 
of crystal lattices. In the next section we shall present the basic ideas of a more rigorous 
approach to the dynamical properties of lattices, which has been highly developed 
since the 1930s and constitutes the presently active field of lattice dynamics. 


11-4 INTRODUCTION TO LATTICE DYNAMICS 

In this section we shall introduce the basic ideas of lattice dynamics by calculating 
the frequency distribution of two types of one-dimensional lattices: one where all the 
masses are the same (a one-dimensional model for an elemental crystal such as Zn) 
and one with two alternating different masses (a one-dimensional model of NaCl, say). 
These two models have quite different vibrational spectra, both of which are qualita- 
tively observed in real crystals. At the end of the section, we shall simply state the types 
of results found in two and three dimensions. 

The Hamiltonian for the first case is (see Fig. 11-2) 

N m N f 

j — 1 2 J= 2 2 


(11-32) 
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where we are using / here for the force constants. The equations of motion 
corresponding to this are (see Problem 11-14) 

mlj =M J+ i + Zj-i - 2 Zj) (11-33) 

This set of equations represents a set of coupled harmonic oscillators. We assume that 
the time dependence is harmonic and let 

= e^yj (11-34) 

where co = 2nv 9 and y } is independent of time. Substituting this into Eq. (1 1-33) gives 

-mvp-yj =f(y j+l + j>;-i - 2yj) (1 1-35) 

This type of equation is called a difference equation (as opposed to a differential 
equation); it is a linear difference equation with constant coefficients. Linear difference 
equations with constant coefficients are solved by y } = A J (compare to y(x) = e nx as 
solutions to linear differential equations with constant coefficients). A little experience 
with equations of this type would suggest letting A be e l<t> . Substituting this into 
Eq. (11-35) gives 

—mco 2 = f(e i4> + e~ i4> — 2) 

= /(2 cos (f> —2) 


or 


CO 




(11-36) 


If we note that the maximum value that co 2 can have is 4 fjm (since the maximum value 
that sin 2 (cj) /2) can have is 1), then we can write 


co = co r 


s Ki) 


(11-37) 


The solution to Eq. (11-33) is then 
{?/*) = e* M + J » 

where cf> is as yet undetermined. 

Since this functional form for £j(t) repeats for every A / = 2n/<£, there is a wave- 
length A equal to a Aj = 2na/(}) 9 where a is the lattice spacing of the one-dimensional 
chain. From this, <t> is given by 


<t> = 


2na 

~ 


= ka 


(11-38) 


where k , which equals 2 n/A, is the wave vector of the motion of the chain. We see 
then that 

Z.(i) = e iUka+m) (11-39) 

represents a wave of wavelength 2njk and frequency co traveling along the chain. From 
the De Broglie relation, hk is the momentum of this wave, or in the language of the 
next section, hk is the momentum of a phonon associated with this frequency. 
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Figure . 11-9 The dispersion curve for a one-dimensional monatomic lattice. [Eq. (11-40)]. The straight 
lines correspond to a continuous string. 


The relation between the frequency and the wave vector is called a dispersion curve. 
The dispersion curve for our simple one-dimensional lattice is [cf. Eqs. (1 1-37) and 
(11-38)] 

" = <l>max|sin^yj| (11-40) 

This is plotted in Fig. 1 1-9. In the limit of small values of ka (wavelength long com- 
pared to the lattice spacing), we have essentially a continuous chain as in the Debye 
theory, and the dispersion curve is 

ka 

co = co m&x — (11-41) 

which we write as 


co „ 

- = Av = = constant velocity 

/c z 


(11-42) 


Typical lattice spacings are 0(10“ 8 cm), and typical frequencies are 0(1 0 1 3 sec -1 ), 
giving O(10 5 cm/sec) for the velocity of the wave. This is the order of magnitude for 
the velocity of sound in solids. 

Note that, in general, however, Av (or co/k) is not a constant. From Eq. (11-40), 



(11-43) 


which shows that the velocity is, in fact, a function of k or A. Waves actually have two 
types of velocities associated with them, and the velocity c(k) defined in Eq. (1 1-43) 
is called the phase velocity. It is the fact that different wavelengths travel with different 
velocities, which leads to the dispersion of waves as they pass through a prism, and so 
co(k) is called a dispersion curve. Dispersion curves can be determined experimentally 
by inelastic neutron scattering measurements. 

An important property of Eq. (1 1-39) for £/t) is that the substitution k-+k„ = 
k -I- 2nn/a with n = ±1, ±2, . . . leaves £/t) unchanged. Furthermore, this substitution 
leaves Eq. (11-40) for co unchanged. In other words, there is no physical difference 
between states corresponding to wave vectors k or k + 2nn/a. In order to obtain a 
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unique relationship between the state of vibration of the lattice and the wave vector 
fc, k must be restricted to a range of values 2n/a. Usually one chooses 


n , n 
--<fc <- 
a a 


We now show how to express the thermodynamic properties of the lattice in terms 
of the dispersion curve. Before doing this, however, we must consider the boundary 
conditions in our lattice. We shall use periodic boundary conditions , namely, that 
= One way of thinking about these boundary conditions is to picture 

the linear chain of N atoms to be bent around into a circle. Clearly, if the chain is long 
enough, joining its two ends has a negligible effect on the thermodynamic properties 
of the chain. Applying these boundary conditions to the i/t) [Eq. (11-39)] gives 
cxp(iNka) = 1, which implies that k = 2nj/Na 9 where j is an integer. Now because k 
is restricted to lie between ±n/a y the possible values for j are ±1, ±2, .. ., ±N/ 2. 
The thermodynamic energy of the crystal (without the zero-point energy) is then 


e=y — ^ — 

j exp (JShcoj) - 1 

Na r n/a hco(k) dk 
7i expQ 3tico(k)) — 1 


(11-44) 


where the summation over the possible wave vectors defined above has been approxi- 
mated by an integral and the fact that co(k) = co(\k\) has been used. We see, then, that 
if <o(k), that is, the dispersion curve of the lattice, is known, we can calculate E and 
other thermodynamic properties of the lattice. 

This integral over k can be converted to an integral over co itself by means of the 
dispersion curve. 



2 dco 

a(oL* - co 2 ) 1 ' 2 


(11-45) 


Substituting this into Eq. (1 1-44) gives 


2N r Wm «« tlCO dco 

n J 0 [expifthco) - l][ft) 2 ax - a) 2 ] 1/2 


(11-46) 


If we compare this to Eq. (1 1-9) (without the zero-point energy ftv/2), we see that 




2 N 1 

V (vi x - v 2 )*/ 2 


(11-47) 


Problem 1 1-33 is involved with showing that a one-dimensional Debye approxima- 
tion agrees with Eq. (1 1-47) as v 0. 

In one dimension, the distribution of frequencies is related to the dispersion curve 
[cf. Eq. (11-47) and Eq. (11-45)] by 


' n dv/dk 

Notice that ^(v) has singularities at the points where dv/dk equals zero. For a con- 
tinuum, dv/dk = velocity = constant. In general, however, dv/dk is not constant. The 
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derivative dvjdk is called the group velocity of the wave and physically represents 
the rate-of-energy transmission of the wave. For a continuum, the group velocity and 
phase velocity [Eq. (1 1-43)] are both equal to the same constant value. 

Now let us consider a one-dimensional lattice with two alternating kinds of atoms 
with masses m l and m 2 . This represents a one-dimensional analog of a crystal like 
sodium chloride. The Hamiltonian for this lattice is 

H = 5 ^ |~ 2 " + ~2 - 1 J + 2 ““ %2j- 1) 2 + (%2j + 1 “ £ 2 j) 2 } 


In this case one gets two sets of equations of motion, one for the masses m l and one 
for the masses m 2 , which eventually yield (see Problem 1 1-34) 


co 


2 


where 



4 m l m 2 sin 2 ^\ 1/2 \ 
(™i + m 2 ) 2 ) ) 


co 0 


2 


f 


(11-48) 


(11-49) 


In these equations, $ = multiple of n/N, and n is the reduced mass of m l and m 2 . 

According to which sign is chosen, the dispersion curve of Eq. (1 1-48) yields two 
branches: a high-frequency branch called the optical branch and a low-frequency 
branch called the acoustical branch. Figure ll-10(a) shows these two dispersion 
curves, and Fig. ll-10(b) shows the corresponding frequency distribution #(v). In 
the normal modes belonging to the acoustical branch, neighboring atoms are displaced 
in the same direction (as in the lower curve in Fig. 1 1-5) to produce a long wavelength 
mode. In the optical branch, neighboring atoms are displaced in opposite directions 
(as in the upper curve in Fig. 1 1-5). 

If the two masses of our lattice are ions of opposite sign (such as NaCl), the vibra- 
tional motion in the optical modes produces oscillating dipole moments. An oscillating 
dipole moment leads to an absorption of infrared radiation. Figure 11-11 shows the 
infrared spectrum of NaCl. Furthermore, the larger the reduced mass, the lower 




(a) (b) 

Figure 1 1-10. (a) The angular frequency for a diatomic chain as a function of the phase angle <f>. 
(b) Density of normal vibrations of a linear chain with m,/fw 2 = 3. 
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Figure H-ll. The absorption of infrared radiation through a thin film of sodium chloride. (From C. 

Kittel, Solid State Physics , 3rd ed. y New York: Wiley, 1967; after R. B. Barnes, Z. 

Physik. 9 75, 723, 1932.) 

the frequency of the optical modes, and hence the longer the wavelengths at which 
the infrared absorption occurs. Figure 11-12 shows the onset of absorption in 
various alkali halides. Evidently, if we wish to get good transmission in the infrared, 
we must use a crystal in which the ions are as heavy as possible. Such considerations 
are important when designing prisms for infrared spectrometers. 

The exact lattice dynamic calculations of two- and three-dimensional lattices 
proceeds in much the same way that we did in the previous section, but the actual 
calculations are much more difficult. Figure 11-13 shows two experimentally deter- 
mined vibrational spectra to give an idea of the complexity of the spectra that are 
obtained for three-dimensional lattices. There have been several exact or near-exact 
calculations of such spectra, and there is now an extensive literature on lattice dynamics 
calculations (see “Additional Reading”). 

The last two sections of this chapter deal with two special topics, namely, the con- 
cept of phonons (Section 11-5) and the concept of defects in crystal lattices 
(Section 11-6). 



0 5 10 15 20 25 30 35 


wavelength (microns) 

Figure 11-12. The onset of the absorption of infrared radiation by alkali halide crystals, showing that 
those alkali halide ion pairs with the heavier reduced masses begin to absorb at longer 
wavelengths. (From C. Kittel, Solid State Physics , 3rd ed. y New York: Wiley, 1967.) 
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(a) (b) 

Figure 11-13. The experimental frequency spectrum of (a) aluminum and (b) iron. (From M. 
Blackman in Encyclopedia of Physics , vol.VII, pt. 1, ed. by S. Fliigge. Berlin: Springer- Verlag, 1955.) 


11-5 PHONONS 

We have seen in Section 1 1-1 that if a crystal has N atoms, it has 3 N normal coordi- 
nates, each with its own characteristic frequency v l9 v 2 , . . . , v 3N . The total energy of 
the crystal is 


3 N 

E({nj}) = £ hv/nj + i) 

j= i 


3 « 3 N 


= E il ¥ J + E 

j= 1 j = 1 



3 N 

= £ hvjtij + 

j= 1 


Eo 


(11-50) 


(11-51) 


where we have set E 0 equal to the total zero-point vibrational energy. Since E is a 
sum of terms, this expression for E can be interpreted as the energy of a system of 
independent particles, which occupy the states 1, 2, 3 N with corresponding 

energies hv u hv 2 , - . . , hv 3N with n x particles in the first state, n 2 in the second, and 
so on. Note that the set of numbers {nj} 9 the occupation numbers of the 3 N states, 
completely specifies the state of the system. Since it is only the number of “ particles ” 
in each state that specifies the system, we can consider these 44 particles ” to be indis- 
tinguishable, and furthermore, since there is no restriction on the numbers nj , that is, 
nj = 0, 1, . . . , 3 N, these “particles” are bosons. 

This is actually a very useful interpretation, since it gives us a system of non- 
interacting bosons, and we can use the Bose-Einstein formulas of Chapter 4 or 
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Chapter 10. As far as the vibrations of the lattice are concerned then, we shall treat it 
as if it were an ideal Bose-Einstein gas. The “particles” we have invented here are 
examples of the quasi-particles we mentioned at the beginning of Chapter 4 and are 
called phonons. Phonons are essentially quanta of lattice vibrations, just as photons 
are quanta of electromagnetic vibrations. Since lattice vibrations are closely related 
to sound waves passing through the crystals, phonons can be thought of as quanta of 
sound waves. 

We can directly apply the basic Bose-Einstein equations of Chapter 4 or even the 
formulas developed in Section 10-5 for a photon gas. For instance, the equation for 
the average occupation of the yth state is [Eq. (4-26)] 


hr*'* 1 

f- Ye~ Tzi ' A~V E ' - 1 


(11-52) 


As we saw in Chapter 10, these quantum statistical equations are awkward to use 
because of the presence of 2, but in the case of a phonon gas, we can evaluate A easily. 
The reason for this is that the number of phonons is, in fact, not fixed, since clearly it 
is possible to have a number of sets {nj} in which £({«,-}) is fixed but n = Y,j n j is 
different. Thermodynamically, the system is characterized by E and V only. A system 
of phonons, then, is mathematically identical to a photon gas (photons obey Bose- 
Einstein statistics and their number is not conserved), and in Chapter 10 we showed 
that p = 0 or A = 1 for this case. In Section 10-5 we gave a simple thermodynamic 
proof that p = 0. In brief, the phonons are likened to a chemical equilibrium of the 
type nA mA y where n and m are integers. Since the number of phonons is not 
conserved, n / m. The condition for equilibrium (Chapter 9) is that ( m — n)p = 0, 
and thus we have p = 0 and A = 1. 

With A set equal to 1, then we have 


nj = 


_ l 


and 


E = 


3 N 


Y rijhVj + E 0 = 
j= i 


3^ /jy . 

i^ +E " 
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If we introduce g{y\ we can write 


E — Eq + 


f°° 9( v )hv dv 

J 0 e * h ' - 1 


(11-55) 


which is the same as Eq. (11-9). Equation (11-54) is the same as Eq. (11-44) without 
the zero-point energy. The derivative of this gives Eq. (10-10) for the heat capacity. 
We can thus derive all of the results of Sections 1 1-2 and 1 1-3 by treating the lattice 
vibrations of a crystal as a gas of noninteracting phonons. 

Actually, the concept of phonons is much more useful than this. For instance, 
phonons can be assigned a momentum (just as a photon can), which must be con- 
served in collisions. The collision of the phonons and the electrons in a metal lead to 
the electrical resistance of a metal. The inelastic scattering of phonons with photons 
is known as Brillouin scattering. The vibrational frequency spectrum of a crystal can 
be determined experimentally by inelastic neutron scattering, which is treated theoreti- 
cally by phonon-neutron scattering. 
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11-6 POINT DEFECTS IN SOLIDS 


Up to now we have assumed that every atom or ion of a crystal is situated at a 
lattice site and that every lattice site is occupied by one and only one particle. Such a 
perfect periodic arrangement is called a perfect crystal. We shall show below, how- 
ever, that such a perfect arrangement is thermodynamically unattainable. Any devia- 
tion from such perfect behavior is called an imperfection or a defect. There are quite 
a variety of defects that exist in crystals, but in this section we shall study only the 
most common point defects, namely, vacant lattice sites and interstitial atoms (extra 
atoms not at lattice sites). Although it will turn out that the concentration of such 
defects will be fairly small, they nevertheless have a profound effect on the properties 
of crystals. The conductivity of some semiconductors is due entirely to trace amounts 
of chemical impurities. The color of many crystals is due to imperfections, and the 
mechanical and elastic properties of crystals depend strongly on the number and type 
of defects present. The diffusion of atoms through solids is another property that is 
dependent upon defects. 

The simplest type of defect is a lattice vacancy. Such a missing atom or ion is known 
as a Schottky defect . A Schottky defect can be created by transferring an atom or 
ion from the body of the crystal to its surface. Although it requires energy to do this, 
there is an increase of entropy. The final equilibrium concentration is found by mini- 
mizing the free energy. If we assume that it takes an energy e v to bring an atom from 
an interior lattice site to a surface lattice site and also assume that the concentration 
of defects is small enough to consider them to be independent, then we can write 


A(n) = E-TS 


= m v — kT In 


N\ 

nl(N-n)l 


(11-56) 


The quantity n is the number of vacancies, and the combinatorial factor is just the 
number of ways of distributing n vacancies over N sites. We now minimize A(n) with 
respect to n to find the number of vacancies expected. Setting ( dA/dri) T equal to zero 
and solving for n give 

n ~ Ne~ Bv/kT (11-57) 


where we have neglected n compared to N. If e v is 1 eV and T — 300°K, then 
n/N « 10" 17 At 1000°K, n/N & 10" 5 (see Problem 11-19). (In ionic crystals such as 
NaCl, it is usually favorable to form roughly equal numbers of positive and negative 
defects. This keeps the crystal electrostatically neutral on a local scale.) 

The other common type of defect that we shall discuss here is a Frenkel defect , in 
which an atom is displaced from a lattice position to an interstitial position. If we let 
Ej be the energy it takes to do this, let N be the number of lattice sites, and let N' 
be the number of possible interstitial sites, then 


\ N\ N'l 

A® ~ m '- kT ' „|(N'-„)I 


(11-58) 


In this case the combinatorial is a product of the number of ways of choosing n out 
of N lattice sites and the number of ways of distributing the n chosen atoms over the 
N' available interstitial sites. Minimizing A(n) gives (see Problem 11-20) 

n & (NN') ll2 e~ ei,2kT 


(11-59) 
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It turns out that the most common type of point defect in alkali halides are Schottky 
defects, and the most common defect in silver halides are Frenkel defects. This is 
probably due to the fact that silver ions are smaller than alkali ions and so can fit 
into interstitial positions more easily. It is possible to determine if one type of defect 
is predominant by careful measurements of the density. The formation of Schottky 
defects lowers the density of the crystal since the volume is increased with no change 
in mass. On the other hand, the formation of Frenkel defects does not change the 
volume, and so there is no change in density. It is also possible to use ionic conduc- 
tivity measurements to differentiate between Schottky and Frenkel defects. 

Lattice vacancies in controlled concentrations can be produced by the addition of 
divalent ions. For instance, if a crystal of KC1 is grown with controlled amounts of 
CaCl 2 , the Ca 2+ enters the lattice at a normal K + site, and the two Cl - enter at 
two normal Cl" sites. The net result of this is the production of vacant positive ion 
sites. This is shown in Fig. 11-14. The formation of these lattice vacancies can be 
observed by measuring the density of CaCl 2 -KCl mixtures. The volume of such a 
mixture is actually larger than the volume of the separate components. 

We said earlier in this section that the rate of diffusion in solids is greatly affected 
by the presence of defects. As an atom diffuses through a crystal, it must surmount a 
series of energy barriers presented by its neighbors as it moves from lattice site to 
lattice site or from interstitial position to interstitial position. Let us consider interstitial 
diffusion of impurities. If this barrier height is e, then exp (— e/kT) can be thought of as 
the fraction of time that the atom will have an energy exceeding e. If v is the frequency 
with which the diffusing atom vibrates around its interstitial position, then the proba- 
bility per unit time that the atom will be able to pass over the barrier is 

p«ve*~ e/ * T ( 11 - 60 ) 

Now consider two parallel planes of impurity atoms in interstitial sites. The planes are 
separated by the lattice constant a. There will be c impurity atoms on one plane and 
c + a(dc/dx) on the other. The net number of atoms crossing between these planes per 
unit time is pa(dcjdx ). If n is the concentration of impurity atoms, then c = an. The 
diffusion flux is then 

7 dn 

J pa di (11_61) 

Remember that a flux of a quantity ¥ is the rate of flow of ¥ through a unit area of 
surface per second. The minus sign occurs because the direction of the diffusion is 



Figure 11-14. Production of a lattice vacancy by the solution of CaCl 2 in KC1: to ensure electrical 
neutrality a positive ion vacancy is introduced into the lattice with each divalent cation 
Ca 2+ . The two Cl - ions of CaCl 2 enter normal negative ion sites. (From C. Kittel, 
Solid State Physics , 3rd ed. New York: Wiley, 1967.) 
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opposed to the concentration gradient. We compare this to Fick’s law, which simply 
states that a diffusion flux is proportional to the concentration gradient of the diffusing 
substance, that is, 

__ 

j= -D— (11-62) 

dx 

where D is the diffusion constant. We see then that 

D = pa 2 « \cFe~ z!kT (11-63) 

Figure 11-15 shows the experimentally determined temperature dependence of the 
diffusion coefficient of carbon in iron. If we let v be 10 14 sec -1 , a be 3 x 10 -8 cm, and 
e be 1 eV, then D » 10 -18 cm 2 /sec at 300°K and 10 -6 cm 2 /sec at 1000°K. (See 
Problem 11-23.) 

Another property that is much affected by defects is the color of slightly impure 
(doped) alkali halide crystals. Pure alkali halides are transparent in the visible region, 
but if a sodium chloride crystal is heated in sodium vapor, it becomes yellow. Similarly, 
if KC1 is heated in potassium vapor, it takes on a magenta color. These same effects 
can be produced by X-ray, neutron, or electron bombardment or by electrolysis. The 
color of the doped crystals is due to defects which absorb light and are called color 
centers . A number of different kinds of color centers have been discovered, but here 
we shall mention only the first discovered and perhaps the simplest, namely, an 
F center. The name comes from the German word for color, fctrbe. F centers have been 
identified by electron spin resonance to be electrons bound to a negative ion vacancy. 
This is consistent with the fact that they can be produced by heating the pure crystal 
in an excess of the metal vapor. When this is done, the metal atoms of the vapor are 
incorporated into the crystal lattice. The metal atom loses its electron, and the positive 
ion takes up a lattice site while the electron associates itself with a vacant negative 
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Figure 11-15. Diffusion coefficient of carbon in iron. (From C. Kittel, Solid State Physics , 3rd ed. 
New York: Wiley, 1967.) 
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ion lattice site which is lacking an anion. A vacant negative ion lattice site has an 
effective positive charge, and so the electron behaves somewhat like the electron bound 
to a nucleus and can absorb light. The model is consistent with a number of experi- 
mental facts. For instance, Fband absorption is characteristic of the crystal and not of 
the alkali metal used in the vapor; that is, NaCl will become yellow whether heated in 
sodium vapor or any other alkali metal vapor. Furthermore, the absorption corre- 
sponds quantitatively to the amount of excess metal, and it has been determined that 
colored alkali halide crystals are less dense than pure alkali halide crystals. 
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PROBLEMS 

11-1. The difference between the constant pressure and constant volume heat capacities is 



In terms of the volume expansion coefficient a v = (1 IV)(dVl&T) p and the isothermal compres- 
sibility k = — (1 IV)(dVldp) T9 this difference is 

TV 

C p — Cy — OCy 

K 

which is often rewritten as 
Cp — Cy{ 1 ycty!*) 
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where y = a v V/kCv is practically independent of temperature. This is called the Griineisen 
constant. Calculate the Griineisen constant given that ay = 6.22 x 10“ 5 deg -1 and k = 
12.3 x 10 “ 12 cm 2 -dyne _1 at room temperature. 

11-2. Find both the high- and low-temperature limiting forms of the heat capacity ac- 
cording to the Einstein model. 

11-3. Derive an expression for the Einstein specific heat for a two-dimensional crystal. 
11-4. Determine the various thermodynamic properties of an Einstein crystal. 

11-5. Derive Eq. (1 1-24). 

11-6. Prove that 

r 00 x*e* dx 4 tt a 

Jo (e* — l) 2 = H~ 

Hint: See Problem 1-63. 

11-7. Derive an expression for the heat capacity of a Debye crystal as a power series 
expansion in © D /7. 

11-8. Why is the heat capacity of diamond at room temperature far below its Dulong- 
Petit value? 

11-9. Prove that for a monatomic crystal that 

f °° [3Nk — Cv(T)] = E(0) 

Jo 

where E( 0) is the energy of the crystal at 0°K. Give a graphical interpretation of this equation. 

11-10. Starting with a two-dimensional wave equation, derive a Debye type equation for 
the heat capacity C v for a two-dimensional crystal, assuming that the transverse and longitu- 
dinal velocities are the same. 

11-11. Show that in a monatomic crystal, the high-temperature limiting form of the heat 
capacity depends only on the existence of a cutoff frequency and is given by the law of Dulong 
and Petit. That is, assume only that the distribution of frequencies 

g{v) =0 for v > 


11-12. The potential energy of the atoms of a solid of density NjV in their equilibrium 
positions is denoted by L/(0; N V). The normal frequencies of vibration of the atoms near 
their equilibrium positions are functions of the density vj(NIV ) where j = 1, 2, . . . , 3N — 6. 
It is a good approximation that 


d log v s 
a log v = ~ Y 


O' = 1. 2, , 3N — 6) 


for all frequencies. The constant y is called the Griineisen constant. Show that under this 
approximation, the pressure of the solid is given by 


P = 


dU E 
dV +Y V 


This equation of state is known as the Mie-Griineisen equation. 

11-13. A modification of the Debye theory was introduced by Bom, who proposed a 
different cutoff for the spectrum of vibrational modes. He proposed that the cutoff be made 
such that both the longitudinal and transverse modes have a common minimum wavelength. 
If we denote this common minimum wavelength by X m , then X m v lonB = Ci onB and X m v iTaM = 
ctrans. Equation (11-23) now becomes 


4t tV 



_ 2 _ 

c t 3 


v 2 dv -f- 



= 3 N 
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Show that this leads to the following expression for the specific heat: 

C-- *[*>(?)+ “$)] 

where D(x) is the Debye function 



e 2 z* dz 

(e 2 - l ) 2 


11-14. Derive Eq. (11-33) from (11-32). 

11-15. The Debye theory treats the crystal and as a continuum body, and hence the disper- 
sion relation for the phonons is a> = ck 9 where c is the speed of sound in the body. In a 
ferromagnetic solid at low temperatures, there exist quantized waves of magnetization called 
spin waves, and the dispersion relation for these type of waves is o> oc k 2 . Find the low- 
temperature heat capacity due to spin waves. 

11-16. Suppose that wavelike quasi-particles, having the dispersion relation cd = Ak n , 
exist in a solid and yield a specific heat when they are excited as thermal motion. Using the 
relation 


Jtcu k 

k expOS#^ — 1 
V r #a>(k) dk 

(2 tt) 3 J exp(0#a>(k)) — 1 


(three dimensions) 


show that the heat capacity is proportional to T 3,n at low temperatures. Note that for n = 1, 
that is, wjk = Xv = A — constant, one has the Debye theory. 

11-17. Consider a planar square lattice of identical atoms which vibrate perpendicular to 
the plane of the lattice. If we let u lm be the displacement of the atom in the /th column and 
mih row, show that the equation of motion is 


/ d 2 Uin\ 

ml “^7“ 1 =/[(«! + 1, in + Ui - i. * — 2l/, m ) + (l/,. n, + 1 + Wl, m - 1 — 2 U lm )] 

where m is the mass of an atom, and / is the force constant. By analogy with Eq. (11-39), 
assume a solution of the form 


u lm = exp[i(lk y a + mk x a + o)t)] 

where a is the nearest-neighbor lattice spacing. Show that the dispersion relation for this system 


2 / 

o) 2 — (2 — cos k x a — cos k y a) 

m 


For ka = (k x 2 + k y 2 ) 1,2 a where a 1, show that 


CD 


-© 


1/2 


11-18. Use the equations E = hv and p = h \ A to show that the group velocity for a free 
particle of mass m is pjm. 

11-19. Calculate the number of Schottky defects per mole of crystal at 300°K and 1000°K 
given that it takes 1.0 eV to bring an atom or ion from an interior lattice site to a surface 
lattice site. 
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11-20. Show that the number n of Frenkel defects in equilibrium in a crystal having N 
lattice points and N' possible interstitial positions is given by the equation 

e > — Ar.riog^ — J 

which, for n^N, N\ gives 
n ~ (AW')" 2 exp 

Here e t is the energy necessary to remove an atom from a lattice site to an interstitial position. 

11-21. (a) Assuming a simple coulombic interaction between positive and negative ion 
vacancies, calculate the binding energy between a pair of oppositely charged vacancies in 
NaCl. 

(b) If«i represents the number of single positive or negative ion vacancies and/z 2 the num- 
ber of pairs of vacancies, derive an expression for the Helmholtz free energy and show by 
minimizing this with respect to m and n 2 that 



where e is the binding energy of a pair of oppositely charged vacancies, and <f> is the energy 
required to produce a single positive and negative ion vacancy. 

11-22. Treat an F center as a free electron moving in the field of a point charge e in a 
medium of dielectric constant e = n 2 where n is the index of refraction. What is the Is — 2p 
energy difference of F centers in NaCl? 

11-23. If a sodium atom next to a vacancy has to move over a potential hill of 0.5 eV, and 
the atomic vibration frequency is 10 12 Hz, estimate the diffusion coefficient at room tempera- 
ture for radioactive sodium in normal sodium. Assume a lattice spacing of 4 A. 

11-24. Consider a closed container containing a small solid and its vapor in equilibrium 
at temperature T. Assume that the volume of the solid, v, is much less than the volume of the 
container, V. Let the partition function of the solid be of the form Q —qs{Tf\ and let there 
be N g molecules in the vapor phase. Show that the equilibrium condition is given by 



where q g is the partition function of a vapor phase molecule. Hint: Minimize the total free 
energy of the system, that is, A g (T y K, N g ) + A S (T, N s ), with respect to N g keeping N g + N, = 
N = constant. 

11-25. Using the result of Problem 11-24, show that the vapor pressure of an Einstein 
crystal is 


P 


, (2nmkTY l2 ( ©A 

=* r bH ( 2sinh 2f) 


3 

e -** kT 


where <f> is given by 1/(0; p) = —N s <f). 

11-26. The pV term in G = A +pV can be neglected for a condensed phase. Using this 
fact, the chemical potential of a Debye crystal can be well approximated by fx = A/N rather 
than the correct GjN. Show that the vapor pressure of a Debye crystal is given by 


[ (2irmkT\ 3t2 
\np = In [—p— ) & 


M 


</>(0)/2 + 9Xr© D /8 
kT 


)-?© 


Identify the quantity </>(0)/2 + 9K® D /%. 

11-27. Consider a gas in equilibrium with the surface of a solid. Some of the molecules of 
the gas will be adsorbed onto the surface, and the number adsorbed will be a function of the 
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pressure of the gas. A simple statistical mechanical model for this system is to picture the 
solid surface to be a two-dimensional lattice of M sites. Each of these sites can be either 
unoccupied or occupied by at most one of the molecules of the gas. Let the partition function 
of an unoccupied site be 1 and that of an occupied site be#(T). (We do not need to know #(70 
here.) Assuming that molecules adsorbed onto the lattice sites do not interact with each other, 
the partition function of N molecules adsorbed onto M sites is then 


Q(N 9 M 9 T) = 


Ml 

Nl(M-N)\ 


[g(T)] N 


The binomial coefficient accounts for the number of ways of distributing the N molecules 
over the M sites. By using the fact the adsorbed molecules are in equilibrium with the gas 
phase molecules (considered to be an ideal gas), derive an expression for the fractional cover- 
age, 6 = NjM, as a function of the pressure of the gas. Such an expression, that is, 6{p\ 
is called an adsorption isotherm, and this model gives the so-called Langmuir adsorption 
isotherm. 

11-28. The low-temperature constant volume heat capacity of many metals can be written 
in the form 


C v = yT+AT 3 

where the T 3 term is from the lattice vibrations and the linear term is due to the electrons. 
In the previous chapter it was shown that CV for a free electron gas is given by C v = 
7t 2 NIcTI2Tf , where T F is the Fermi temperature. Given that the Fermi temperatures of Na 
and Cu are 3.7 x 10 4 °K and 8.2 x 10 4O K, respectively, calculate y and compare it to the 
experimental values. 

11-29. Compare the contributions of the electrons and the lattice vibrations to the heat 
capacity of sodium at low temperatures. 

11-30. Show that the bulk modulus B = V(dP/dV) T of an electron gas at 0°K is B = 
iP=10Eo!9V. The valence electron density of potassium is about 1.40 x 10 22 cm -3 . Cal- 
culate B for potassium and compare with the experimental value of 3.66 x 10 10 dyne/cm 2 at 
4°K. 

11-31. Show that the entropy of a Debye crystal at low temperature is given by 


S = 


4ir*Nk 

5 



3 


11-32. The heat capacity of copper at 100°K is 3.85 cal/mole-deg. Using this information, 
calculate the value of © £ and © D for copper. Now calculate the heat capacity at 25°K and 
compare it to the experimental value of 0.23 cal/mole-deg. Which model gives better results 
at low temperatures? 

11-33. Show that giy) of a one-dimensional Debye crystal agrees with Eq. (1 1-47) as v -► 0. 

11-34. Derive Eq. (11-48). 



CHAPTER 12 


IMPERFECT GASES 


In the limit of low densities, all gases approach perfect-gas behavior, or in other 
words, they obey the well-known equation of state 

p = pkT (12-1) 

This equation was derived in Chapter 5 for a monatomic gas in which the intermolecu- 
Iar potential could be ignored. Physically, this means that the particles spend most of 
their time far away from each other and so do not “ feel ” any intermolecular potential. 
To see again how Eq. (12-1) arises from statistical thermodynamics, consider the 
classical canonical partition function of N monatomic particles contained in a volume 
V at temperature T : 

Q= NUP i S € ~ PHdPi •••** ( 12 - 2 ) 


Since H is of the form 


1 N 

H =T~ E (A - 2 + Py n 2 + Pzn) + U(x u y u . . . , Z N ) 

2m n = 1 


we can immediately integrate over the momenta to get 


Q 


1 (2nmkT\ 3N/2 r _, 
W! \ I? ) Z ” 


(12-3) 


where Z N is the configuration integral 

Z N = J J e~ Vr,lkT di t di 2 • • • dr^f (1 2-4) 

If we can neglect U N in the configuration integral, then Z N = and Q = q N /Nl where 
q(V, T) = (2nmkT /h 2 ) 3/2 V. The key point is that it is q being of the form f(T)V that 
leads directly to the ideal gas equation of state. Although we have discussed only the 
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case of monatomic gases here, the same result holds true for polyatomic gases. This 
is easily proved at the expense of introducing a number of angular integrations (cf. 
Chapters 6 and 8). 

As the density of a gas is increased, the particles are closer on the average, and the 
intermolecular potential becomes nonnegligible. Thus the configuration integral is 
no longer simply V N , and the ideal gas equation of state is not obtained as the equation 
of state of the gas. Of course, it is well known experimentally that real gases exhibit 
deviations from ideal gas behavior as the density is increased. A large number of 
empirical and semiempirical equations of state have been constructed to describe 
the deviations from the simple ideal gas law. The most fundamental of these, in the 
sense that it has the most sound theoretical foundation, is the so-called virial equation 
of state, originally proposed by Thiesen and developed by Kamerlingh-Onnes. The 
virial equation of state expresses the deviations from ideal behavior as an infinite power 
series in p : 

^=P + B 2 (T)p 2 + B 3 (T)p 3 + — (12-5) 

The quantities B 2 (T) y B 3 (T) y are called the second, third, virial coefficients, 
respectively, and depend only on the temperature and on the particular gas under 
consideration, but are independent of density or pressure. The primary goal of this 
chapter is to derive expressions for B 2y B 3y and so on, in terms of intermolecular 
potentials. 

We shall show in Section 12-1 that the yth virial coefficient can be calculated in 
terms of the interactions of j molecules in a volume V. This is proved most readily by 
means of the grand canonical partition function. Thus we shall show that the N - body 
problem of an imperfect gas can be reduced to a series of one-body, two-body, three- 
body problems, and so on. The initial deviations from ideality (up to 10 atm, say) 
rest in B 2 (T) y which we shall see is easy to calculate since it involves only two-body 
interactions. The derivation presented in Section 12-1 is valid for any one-component 
gas, including polyatomic quantum-mechanical gases whose intermolecular forces are 
not pair-wise additive. Then in Section 12-2 we shall specialize these results to a 
classical monatomic gas whose intermolecular potential is pairwise additive. The next 
two sections are somewhat detailed discussions of the experimental and theoretical 
second and third virial coefficients. Section 12-5 is devoted to the calculation of virial 
coefficients higher than the third; Section 12-6 discusses quantum corrections to the 
second virial coefficient; Section 12-7 discusses the law of corresponding states; and 
lastly. Section 12-8 is a general discussion. 

Before going on, however, it is helpful to consider Table 12-1, which gives the 
contribution of the first few terms of the virial expansion to p/pkT. The data are for 
argon at 25°C. The contributions of all the remaining terms are shown in the paren- 
theses. It can be seen that the second and third virial coefficients alone give most of 
p/pkT up to pressures approaching 100 atm. 

Table 1 2-1 . The contribution of the first few terms in the virial expansion of p/pkT for argon at 25°C 
p(atm) p/pkT 

1 + B 2 p + B 3 p 2 + remainder 

1 1 - 0.00064 + 0.00000 + • • • (+0.00000) 

10 1 - 0.00648 + 0.00020 + • • • (-0.00007) 

100 1 - 0.06754 + 0.02127 + • • • (-0.00036) 

1000 1 - 0.38404 + 0.68788 + • • • (+0.37232) 


Source : E. A. Mason and T. H. Spurling, The Virial Equation of State (New York: Pergamon, 1969). 
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12-1 THE VI RIAL EQUATION OF STATE 

FROM THE GRAND PARTITION FUNCTION 


“ The grand partition function is \cf. Eq. (3-15)] 

E(V T, m) = f Q(N, V, T)A w (12-6) 

N=0 

where A = exp(/?/i). When N = 0, the system has only one state with E = 0, and so 
Q(N = 0,V,T)=l. This allows us to write Eq. (12-6) as 

S(K, T, & = 1 + £ Q n (V, T)X n (1 2-7) 

Af=l 

where we have written Q N (V, T) for Q(N, V, T). The characteristic thermodynamic 
function associated with E is pV according to the relation 

pV = kT\n 2 (12-8) 


The average number of molecules in the system is given by [cf. Eq. (3-33)] 


N = kT 


i VH /V,T \ VA /V,T 


( 12 - 9 ) 


Thus we have the pressure and essentially the density in terms of S. The standard 
procedure to eliminate E between these two quantities is to obtain a power series for 
In E in some convenient parameter and then to eliminate this parameter between 
Eqs. (12-8) and (12-9). The most obvious choice for this expansion parameter is A, 
since we already have E as a power series in A in Eq. (12-7). It is more convenient, 
however, though not at all necessary, to define a new activity z, proportional to A, 
such that z -► p as p -* 0. By taking the limit A 0 in Eq. (12-9), we find that 


id In S\ 

N -i-nrh.r lQ ‘ ( ^ 0) 

Thus as A -> 0, the density p -► Ag x /K, and so we set z = XQJV. In terms of this new 
activity, then 

S(K, T, n) = \ + j[ (^r)* W (1 2-1 0) 

It is convenient to define a quantity Z N by 



It will turn out that the classical limit of the Z N defined here is just the configuration 
integral given in Eq. (12-4). With these definitions, Eq. (12-7) becomes 


_ i . £ Z*(V 9 T ) N 
“ =1 + I T7\ z 

N— i /V! 


( 12 - 12 ) 


This gives us E as a power series in z. 

We now assume that the pressure can be expanded in powers of z according to 

p = kr f; b J z i 

7=1 


( 12 - 13 ) 
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where we wish now to determine the unknown coefficients bj in terms of the Z N of 
Eq. (12-1 1). This can be done directly by substituting Eq. (12-13) into S = exp (pV/kT), 
expanding the exponential, collecting like powers of z, equating the coefficients to 
those of Eq. (12-12), and finally solving for the bj in terms of the Z N . The result of this 
straightforward algebra is (see Problem 12-5) 

b t = (\\vy'z t = i 
b 2 = {2\V)~\Z 2 -Z 2 ) 

*3 = (3!P)- 1 (Z 3 -3Z 2 Z 1 + 2Z 1 3 ) 

= (4! V)~ l (Z A - 4Z 3 Zj - 3Z 2 2 + 12Z 2 Z a 2 - 6 Z A 4 ) 

( 12 - 14 ) 


It is possible to write down a general formula for b } in terms of Z’s, but we shall not 
need it. We now know the bj in Eq. (12-13) in terms of the Z N . Notice that the calcu- 
lation of b 2 , for example, involves the calculation of only Z 2 and Z u that is, essentially 
the partition functions of two and one particle, respectively. Similarly, b 3 involves the 
determination of a partition function for three particles at the most. Thus we see that 
we have reduced the original N-body problem to a series of few-body problems. This 
was accomplished by using the grand partition function. 

We are not finished yet, however, since we really want an expansion of the pressure 
in terms of the density p and not some activity z. But we not only have the pressure 
in terms of z, we also have the density since the pressure and density are connected by 
In S through Eqs. (12-8) and (12-9). Thus we can write 


N _ k /d In E\ 

v~v\ 8k ) VtT 


2 

kT 



z 

V 



from which we have 


P = £ jbjZ 1 ( 12 - 15 ) 

j= l 

Now we have both p and p as power series in the activity z. The problem is to eliminate 
z between the two equations. There is a general mathematical technique to accomplish 
this (by means of complex variable theory), but we can determine the first few terms 
in a direct algebraic way (cf. Section 10-1). We simply write 


z = a t p + a 2 p 2 + a 3 p 3 + • • • 


Substitute this into Eq. (12-15) and equate like powers of p on the two sides of the 
equation to get 

a t = 1 

a 2 = —2b 2 

a 3 = — 3Z> 3 + %b 2 2 
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Now that we have z as a power series in p. We substitute this into the equation for p 
as a power series in z [Eq. (12-13)] to get the desired final result, namely,/? as a power 
series in the density: 

-j^=p + B 2 (T)p 2 + B 3 (T)p 3 +-- 
where 

B 2 (T) = -b 2 = -(2! V)~\Z 2 - Z 2 ) 

B 3 (T) = 4b 2 2 - 2b 3 

= - 3^5 W(Z 3 - 3Z 2 Z, + 2Z, 3 ) - 3(Z 2 - Z 2 f\ 


( 12 - 16 ) 


( 1 2 - 17 ) 


The equations become increasingly complicated as one goes to higher virial coefficients, 
but Table 12-1 shows that the first few virial coefficients suffice to account for p-V~T 
data up to hundreds of atmospheres. 

In the next section we shall apply these general equations to the important case of a 
classical monatomic gas. Before doing this, however, several comments are in order. 
We have expressed the virial coefficients in terms of the bf s which, in turn, are given 
in terms of the Z N . The b/s then appear to be simply intermediate quantities with no 
physical significance. They do have an interesting physical significance, however, that 
is lost in the grand canonical ensemble derivation presented here. The first statistical 
mechanical development of the virial expansion is due to Mayer, who used the 
canonical ensemble as his starting point. In this approach, the b/s turn out to be related 
in a way to clusters of j molecules and play a more central role in the development 
of the final equations. (See Mayer and Mayer in “Additional Reading.”) 

Another point involves the expansion of p or In E in powers of z. We have tacitly 
assumed that such an expansion exists in the first place. This assumption is correct 
for all the cases we shall discuss in this book, but it is not obvious that it is generally 
valid. It is possible that special cases of highly degenerate or strongly interacting 
systems do not allow such a power series expansion. For example, we shall see that 
the intermolecular potential between a pair of particles must go to zero more rapidly 
than r ~ 3 in order that the second virial coefficient be finite. (See Problem 12-6.) 
Thus we see that a virial expansion for a fully ionized gas, that is, a plasma, is invalid. 

In summary then, if the virial expansion exists, we have a method for calculating 
the virial coefficients from partition functions. Furthermore, this series can be shown 
to converge in some nonzero region for a large class of intermolecular potentials. 
At one time there was much effort to relate this radius of convergence to the point at 
which condensation sets in, but it is now felt unlikely that there is any such relation. 


12-2 VIRIAL COEFFICIENTS IN THE CLASSICAL LIMIT 

For simplicity we shall consider only monatomic gases in this section. The extension 
to include molecules with internal degrees of freedom is straightforward,, but the 
equations are more complicated. (See Problems 12-29 through 12-32.) In the classical 
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limit, Q(N, V, T) is given by Eq. (12-2). In particular, for N = 1, in the absence of 
external forces we have U = 0, and so we find that 


Qi(V, T ) = 



V 

A 5 


( 12 - 18 ) 


For N > 1 we can still integrate over the momenta as we did to get Eq. (12-3), giving 
Ul, ~N\ A 3N 


where Z N is the configuration integral given by Eq. (12-4). But according to Eq. (12-1 8), 
we can write this as 



( 12 - 19 ) 


Comparing this to Eq. (12-11), we see that the quantity Z N defined there is just 
the configuration integral. This, of course, is why we used the symbol Z N for both 
quantities. 

In order to calculate the second and third virial coefficients then, we need 

z^Ja^v 

Z 2 = jj e~ Ul,kT dr t dr 2 

and 

Z 3 = J J J e ~ u * lkT di x dv 2 dr 3 (12-22) 

To calculate the second virial coefficient, we need U 2 . For monatomic particles it is 
reasonable to assume that U 2 (r u r 2 ) depends only upon the separation of the two 
particles, so that U 2 = u(r X2 ) 9 where r X2 = |r 2 — rj. The intermolecular potential 
between two particles can be calculated, at least in principle, from quantum mechanics. 
We shall discuss this in Section 12-3. Figure 12-1 (a) shows a typical intermolecular 
potential between two spherically symmetric molecules. We can derive an equation 
for B 2 (T) in terms of u(r l2 ) by substituting Z 2 and Z x into Eq. (12-16) to get 

B 2 (T)=-~(Z 2 -Z l 2 ) 


( 12 - 20 ) 

( 12 - 21 ) 


= - 2 ^ JJ [e~ pui,l2) -\\dx t dr 2 (1 2-23) 

For neutral molecules in their ground electronic states, u(r l2 ) goes to zero fairly 
rapidly, say in a few molecular diameters, and so the integrand in B 2 is zero unless the 
volume elements dr x and dr 2 are near each other. Thus it is possible to change the 
variables of integration to r x and the relative coordinates r i2 = r 2 — r t and write 

B 2 (T) = - ^7 J*1 j - 1] *1 2 (1 2-24) 

Now the integration over the relative separation of the two particles is independent 
of where the pair is in the volume K, except for when the pair of particles is near the 
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Figure 12-1 . (a) A typical fntermolecular potential w(r) and (b) a Mayer /-function e fiv{r) — 1 plotted 
versus the intermolecular separation r. 


walls of the container. But thermodynamically we are interested only in the case that 
V -► oo, and so this “ surface effect ” becomes entirely negligible. Thus we can carry 
out the integration over dr t separately to give a factor of V and also write 4nr 2 dr for 
dr 12 to obtain the final result 


B 2 (T) = -In f [e~ Pu(r) - 1 ]r 2 dr 
J o 


( 12 - 25 ) 


This gives the second virial coefficient as a simple quadrature of u(r). We shall give a 
number of applications of this formula in Section 12-3. The limit of integration has 
been formally extended to infinity rather than just the walls of the container, since the 
integrand falls essentially to zero for r larger than a few molecular diameters. Note that 
B 2 is independent of V. 

The term in brackets in the integrand of Eq. (12-25) appears throughout the 
equations of imperfect gas theory and is commonly denoted by /(/*), 

fij =f( r u) = e~ u(r,j)lkT - 1 ( 1 2 - 26 ) 

Since u(r) -► 0 as r increases, we see that f{r) -+ 0 as r increases. The importance of the 
function f(r) was exploited by Mayer and is now called a Mayer /-function. Figure 
12— 1(b) shows a typical f(r) plotted versus r. 

To obtain the third virial coefficient, we need the potential U 3 (r u r 2 , r 3 ). Here is 
where the question of the pairwise additivity of intermolecular forces first arises. For 
many years it was common to assume that the intermolecular potential of a group of 
three molecules was the sum of the potentials of the three pairs taken one at a time, 
that is, to assume that 


U 3 (r i, r 2 , r 3 ) » u(r l2 ) + u(r 13 ) + u(r 23 ) 


( 12 - 27 ) 
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This apparently can be used as a good approximation, but eventually suffers from 
close scrutiny. In general, we should write 

U 3 = w(r 12 ) + u(r l3 ) + u(r 23 ) + A (r 12 , r 13 , r 23 ) 
or in an obvious notation which we shall use often 

U 3 = U 12 + U 13 + W 23 + ^3 ( 12 - 28 ) 

where A represents the deviation from pairwise additivity. We shall neglect A in 
what we do below, but Problem 12-26 involves the generalization to include a non- 
additive contribution to U 3 . We shall now derive an expression for B 3 (T) under the 
assumption (apparently mild) of pairwise additivity. This derivation involves manipu- 
lations that may at first be unfamiliar, but are actually quite simple and, furthermore, 
are common in the statistical mechanical theories of gases and liquids. 

In order to calculate B 3 , we must calculate b 3 , which is given by 6Vb 3 = Z 3 
— 3Z 2 Z a + 2Zi 3 [cf. Eq. (12-14)]. For Z 3 we have 

+ /12XI +/ 13 )(1 +/ 23 ) *1 * 2 *3 


-JJJVu / 13/23 + /12/13 +/12/23 +/13/23 +/12 +/13 +A3 + 1]*1 di 2 dr 3 

(12-29) 

The next step is to subtract 3Z 2 Zj from this. Since Z x = V, we write 

Z,Z 2 = V jj(f 12 + 1) dr t dr 2 = JJj (/ 12 + 1) dr 2 dr 2 dr 3 (1 2-30) 

where we have taken the volume in under the integral sign in order to obtain 
di x di 2 di 3 as in Z 3 above. Rather than to just subtract three times Eq. (12-30) from 
Z 3 , however, we recognize that Z t Z 2 can also be written as 

Z t Z 2 = V JJ(/i3 + 1) dt 2 dx 3 = JJJ (/ 13 + 1) dr t dr 2 di 3 

or equivalently 

Z X Z 2 = vfj (f 23 + 1) dx 2 dr 3 = JJJ (/ 23 + 1) dr, dr 2 dr 3 

So instead of subtracting three times any one of these expressions for Z X Z 2 from 
Z 3 , we subtract each once and get 

Z3 — 3 Z x Z 2 = JJJ [/12/23/13 +/12/13 +/12/23 +/13/23 ” 2 ] di x dr 2 dt 3 
For the 2 Z x that must be added to this to get 6Vb 3 , we add 

2 JJJ dr, dr 2 dr 3 

to get 

6Vb 3 = JJJ [/12/13/23 +/12/13 + f iif 23 +/13/23] dt 1 dr 2 di 3 


(12-31) 
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Recall now that B 3 (T) = 4b, 2 — 2b, , or more conveniently (in order to keep the 
factor of 6V with b 3 ) 

B 3 (T)=-±(6Vb 3 -12Vb 2 2 ) 

The term 6 Vb 3 is given by Eq. (12-31), and so we need only write \2Vb 2 2 in some 
convenient form and subtract it from 6 Vb 3 . Since 



we can write 

4b 2 2 = |jVi2 ^12^ — fn dr l2 ^ |j/i3 ^ r i 3 J 


or 


4 yb 2 2 — jdi x J/12 dr i2 J/13 di X3 


= JJJ/12/13 *1 *2 *3 

where the last line is just the inverse of the procedure of converting to relative coordi- 
nates. Clearly the subscripts on the /-functions are arbitrary, and we can readily 
derive two alternative expressions for 4 Vb 2 2 > one with / 13 / 23 and one with / 12 / 23 in 
the integrands. The three of these make 12 Vb 2 2 , the quantity to be subtracted from 
6Vb 3 to give B 3 . Thus we have finally 

= “ yp JJJ/12/13/23 di 2 di 3 ( 12 - 32 ) 

Let us examine the integrand of B 3 more closely. It involves three particles, and 
since f u -► 0 as particles i and j are separated, the product /12/13/23 vanish unless 

all the three particles are simultaneously close to one another. We can represent the 
integrand of B 3 pictorially in the following way. We draw a numbered circle for each 
different subscript appearing in the product and a line between each pair of particles 
connected by an /-function. For example, the integrands in the second and third 
virial coefficients can be represented schematically as shown in Fig. l2-2(a). Thus we 
see that all three particles in the integrand of B 3 are connected by /-functions. Since 
this represents a cluster of particles, we call diagrams as in Fig. 12-2 cluster diagrams. 
The only other cluster diagrams for three particles are shown in Fig. 12-2(b). 

There is a general result of imperfect gas theory whose proof is too complicated to 
present here, but can be readily appreciated in terms of cluster diagrams. This result 
states that the virial coefficients are given by 

( 12 ~ 33) 

where 

Pj = J\y j *1 dr 2 ■” *j+i 


( 12 - 34 ) 
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(a) 













Figure 12-2. Some examples of cluster diagrams, (a) The Integrands of the second and third virial 
coefficients, (b) The three topologically equivalent diagrams of three particles that are 
singly connected, (c) The three topologically different stars or doubly connected diagrams 
for four particles. Compare the degeneracies of these with Eq. (12-3S) and Table 12-2. 

where S[ t2 J+1 is the sum of all products of /-functions that connect molecules 

1, 2, . . . , j + 1 such that the clusters are connected in such a way that the removal of 
any point, together with all of the lines associated with that point, still results in a 
connected graph, that is, all the particles connected to one another. Notice that all the 
graphs appearing in Fig. 12-2(a) are connected in this way, while those in Fig. 12— 2(b) 
are not. Such diagrams are also called doubly connected since each particle is con- 
nected to any other by two independent paths. Doubly connected diagrams are also 
called stars. 


All the stars of up to seven particles are listed in Appendix 1 of an article by Hoover 
and DeRocco.* Table 12-2 lists the number of topologically distinct connected and 

Table 1 2-2. The number of topologically different connected graphs C(n) and star graphs S(n) for n <1 


n: 

2 

3 

4 

5 

6 

7 

C(n): 

1 

2 

6 

21 

112 

853 

S{n): 

1 

1 

3 

10 

56 

468 


Source : W. G. Hoover and A. G. DeRocco, J. Chem. Phys., 36, p. 3141, 1962. 


* W. G. Hoover and A. G. DeRocco, J. Chem . Phys., 36, p. 3141, 1962. 
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Figure 12-3. The graphs of p points and k lines of up to five particles, n is the number of topologically 
equivalent graphs of that type. (From G. E. Uhlenbeck and G. W. Ford, in Studies in 
Statistical Mechanics ed. by J. DeBoer and G. E. Uhlenbeck. New York: North- 
Holland Publishing Co., 1962.) 
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star graphs for n < 7. For example, for n = 3 there are two topologically different 
connected graphs, namely, L and A, and only one star graph, namely, A. 
Figure 12-2(c) shows the three different types of star diagrams that occur for n = 4. 
The three connected diagrams that are not stars for n = 4 are of the form LI, , 
and D . Note that clusters like I/I and IX are really equivalent (topologically) to 
u. Figure 12-3 gives all the distinct connected graphs and star graphs of up to five 
particles. Note how Fig. 12-3 is consistent with Table 12-2. 

In summary, then, the virial coefficients are integrals over sums of stars, and for 
n = 2, 3, and 4 these sums are given by 


*Sl, 2f 3.4 = 30 + 60 + E (12-35) 

The coefficients in 2>3>4 represent the number of equivalent stars of that form 
[see Fig. 12-2(c)]. We shall now discuss the calculation of B 2 (T) and B 3 (T) for a 
number of intermolecular potential functions. 

12-3 SECOND VIRIAL COEFFICIENT 

Equation (12-25) shows that once the intermolecular potential u(r) is known, the 
second virial coefficient can be calculated as a function of temperature. Second virial 
coefficients can be measured experimentally over a large temperature range to within 
a few percent. Figure 12-4 shows some experimental second virial coefficients versus 



Figure 12-4. The solid line is the reduced virial coefficient for the Lennard-Jones 6-12 potential as a 
function of the reduced temperature T*. Experimental data of a number of substances are 
also given. (From J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular 
Theory of Gases and Liquids. New York: Wiley, 1954.) 
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temperature. It is the goal of this section to understand this curve from a theoretical 
point of view. 

In principle, u(r) can be obtained from quantum mechanics, but this is a very 
difficult numerical problem and has not been done for anything more complicated 
than H 2 .* One can show from perturbation theory, however, that asymptotically 

u(r) -> -C 6 r“ 6 (1 2—36) 


Usually one uses simple analytical expressions with adjustable parameters for u(r ) 
that go asymptotically as r~ 6 . These adjustable parameters can then be varied to 
fit experimental data. Perhaps the most well-used form is 


u(r) = 


ne 

n — 6 


6/(i i-6) 


{©*-01 


(12-37) 


where a is the distance at which u(r) = 0, and e is the depth of the well. The exponent 
n is usually taken to be an integer between 9 and 15, but for historical reasons, 12 is 
still the most popular value. The r~ 6 is included in Eq. (12-37) so that u{r) has a 
correct asymptotic form. For n = 12, u(r) is called the Lennard-Jones 6-12 potential: 


u(r) = 4e 



(12-38) 


When plotted versus r, this potential is similar to that in Fig. 12-1 (a). There are many 
other potentials in use nowadays, but they are all similar to this. We shall discuss some 
of these shortly. 

It is not possible to integrate B 2 (T) analytically if Eq. (12-38) or any other realistic 
potential is used, and so before discussing this, however, let us consider some simpler 
but less realistic forms that have the advantage of allowing B 2 (T) to be integrated 
analytically. 


A. HARD-SPHERE POTENTIAL 

The hard-sphere potential has the form 



r < a 
r>c 


(12-39) 


This potential has no attractive part, but does simulate the steep repulsive part of 
realistic potentials. This is the simplest potential used and is the only potential for 
which the first seven virial coefficients have been calculated. (See, however. Problem 
12-39.) It is the potential often used by theorists to try to understand things in a general 
way (we shall come back to this later). A system of particles with this potential is 
called a hard-sphere gas or fluid. With the hard-sphere potential, B 2 (T) is 

B 2 (T) = - J f (~)4nr 2 dr= (1 2-40) 


Note that this is four times the volume of a sphere and also is independent of tempera- 
ture. 


* W. Kolos and L. Wolniewicz, J. Chem. Phys., 41, p. 3663, 1964. 



SECOND V1R1AL COEFFICIENT 235 


B. SQUARE-WELL POTENTIAL 

An extension of the hard-sphere potential that includes an attractive term and yet 
is simple enough to handle analytically is the square-well potential: 

oo r <c 

u(r) = —e a <r < hs (1 2-41 ) 

0 r > hj 


2, the range of the attractive well, is usually taken to be between 1.5 and 2.0. If this 
potential is substituted into Eq. (12-25), one gets 

B 2 CO = b 0 {\ - (P - - 1)} (12-42) 


The quantity b 0 is the hard-sphere second virial coefficient 27rcr 3 /3. Note that as 2 -► 1 
or e 0, Eq. (12-42) reduces to the hard-sphere result. Equation (12-42) can be used 
to fit experimental data very well, at least at ordinary temperatures. It does not pass 
through a maximum, however. (Why not?) Figure 12-5 shows the calculated and 
experimental second virial coefficient for argon, and Table 12-3 gives the square-well 
parameters for a number of molecules. 


B 2 {T), (cm 3 /mole) 



Figure 12-5. Second virial coefficients for argon calculated for several molecular models. The potential 
functions obtained from the experimental B 2 {T) data are also shown. (The experimental 
data are those of L. Holbom and J. Otto, Z Physik , 33, 1, 1925, and A. Michels, Hub. 
Wijker, and Hk. Wijker, Physica , 15 627, 1949, from J. O. Hirschfelder, C. F. Curtiss, 
and R. B. Bird, Molecular Theory of Gases and Liquids , New York: Wiley 1954.) 
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Table 12-3. Potential parameters determined from second virial coefficient data 


substance 

potential 

A 

ct(A) 

e/kC K) 

argon 

sw 

1.70 

3.067 

93.3 


LJ 


3.504 

117.7 

krypton 

sw 

1.68 

3.278 

136.5 


LJ 


3.827 

164.0 

methane 

sw 

1.60 

3.355 

142.5 


LJ 


3.783 

148.9 

xenon 

sw 

1.64 

3.593 

198.5 


LJ 


4.099 

222.3 

tetrafluoromethane 

sw 

1.48 

4.103 

191.1 


LJ 


4.744 

151.5 

neopentane 

sw 

1.45 

5.422 

382.6 


LJ 


7.445 

232.5 

nitrogen 

sw 

1.58 

3.277 

95.2 


LJ 


3.745 

95.2 

carbon dioxide 

sw 

1.44 

3.571 

283.6 


LJ 


4.328 

198.2 

w-pentane 

sw 

1.36 

4.668 

612.3 


LJ 


8.497 

219.5 

benzene 

sw 

1.38 

4.830 

620.4 


LJ 


8.569 

242.7 


Source : A. E. Sherwood and J. M. Prausnitz, /. Cftem. Phys. t 41 , p. 429, 1964. 


C. LENNARD-JONES POTENTIAL* 

The hard-sphere and square-well potentials are simple enough to yield analytic 
expressions for virial coefficients, but presumably do not represent an actual inter- 
molecular potential function. As we mentioned above, the Lennard-Jones potential 
[Eq. (12-38)] is the most commonly used form that does qualitatively represent the 
behavior in Fig. 12-1 (a). The second virial coefficient for this potential is 


Bs(r) - - s C M - w [GT ■(;)])■ i ] 4 " r! dr 


(12—43) 


This must be evaluated numerically. Before doing this, however, it is convenient to 
write Eq. (12-43) in a reduced form by defining a reduced distance, x = r/c, and a 
reduced temperature T* = kT/e. If we define a reduced second virial coefficient 
B 2 * = B 2 ! b 09 then 


B 2 *(T*) = 




(12-44) 


The reduced second virial coefficient B 2 * is a well-tabulated function. (See Hirschfelder, 
Curtiss, and Bird in 44 Additional Reading.”) 

There are a number of procedures for determining the “best” values of o and e 
from experimental second virial coefficient data. One way is to choose any two 
temperatures, say and T 2 , and calculate the ratio 


k B — 


B 2 {T 2 ) 


kxptl 


and set this equal to 


B 2 *(kT 2 le ) 
B 2 *(kTjE) 


Section 3-6 of Hirschfelder, Curtiss, and Bird (in “Additional Reading**) has an excellent discussion of 
virial coefficients and the Lennard-Jones potential. 
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Then hunt for an e by trial and error in the table of B 2 *(T*) until the calculated ratio 
agrees with the experimental ratio. This gives e. Then calculate o from 

B 2 (T l ) _ 2no 3 

B 2 *{T*) 0 3 

Hopefully, a and e will be independent of the choice of T x and T 2 and would be if the 
Lennard-Jones potential were really the “exact” potential. Table 12-3 gives the 
Lennard- Jones parameters for a number of molecules, and Fig. 12-4 shows the reduced 
second virial coefficient as a function of T* for several gases. Figure 12-5 shows B 2 
for the Lennard-Jones potential and the square-well potential. Notice that the two 
curves for B 2 are almost indistinguishable, although the insert shows that the two 
potentials themselves are quite different. It has often been said that the second virial 
coefficient is extremely insensitive to the potential used, but there is some indication 
that, if taken over a sufficiently large temperature range, it can be used to select 
potential functions.* 


D. OTHER POTENTIALS 

There are a number of other realistic potentials besides the Lennard-Jones, and B 2 
has been tabulated and used for all of them. Fitts’ review article! discusses several of 
these and their comparison with experimental data. One of the most successful of 
these simply includes anr 8 term in the potential given in Eq. (12-37) to give 



( 1 2 - 45 ) 


In terms of the usual parameters of e, the depth of the potential well, and of o, the 
distance at which the potential equals zero, u(r ) becomes 


u(r) (6 + 2 y) (do\ n [n - y(n - 8)] (do\ 6 MA 8 

e n — 6 \ r / n — 6 \ r / \r/ 


( 12 - 46 ) 


where d = rjo and y = C 8i er m 8 and r m is the distance at which the potential is a 
minimum. Since this potential has two more parameters than the Lennard-Jones 
6-12 potential, it is able to give much better agreement with experimental data. 
Nevertheless, it has been subjected to a fairly critical test and has performed quite 
well.! Mason and Spurling discuss a large number of other intermolecular potentials. 
(See Mason and Spurling under “ Additional Reading.”) 


12-4 THIRD VIRIAL COEFFICIENT 

The third virial coefficient can also be measured experimentally, although not as 
accurately as the second virial coefficient. Equation (12-32) for B 3 (T ) is 

Bs(T) = — — JJJ/12/13/23 d T i ^ r 3 

v 

= ~ 2 Jjfl2 fl 3 f 23 ^ r l3 ( 12 - 47 ) 

We shall show how to calculate B 3 analytically for the hard-sphere and square-well 
potentials in the next section. For the Lennard-Jones potential, we can measure all 

* A. E. Kingston, /. Chem. Phys., 42, p. 719, 1965. 

t D. D. Fitts, Ann. Rev. Phys. Chem., 17, p. 59, 1966. 

t H. J. M. Hanley and M. Klein, /. Phys. Chem., 76, p. 1743, 1972. 
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distances in Eq. (12-47) in terms of o and define a reduced temperature T* 9 and so 
on, to give 

B 3 *(T*) = ^ JJ / 12 / 13 / 23 dr 12 * dr l3 * ( 12 - 48 ) 

which, of course, must be evaluated numerically. This gives B 3 /b 0 2 as a universal 
function of kT/e 9 and this is shown as a function of T* in Fig. 12-6. Although the 
figure caption suggests reasons for the discrepancy between the experimental points 
and the theoretical curve, the principal reason is probably due to the nonadditivity of 
the three-body potential. 

Third virial coefficients, including a nonadditive contribution, have been calculated 
for several other potentials by Sherwood and Prausnitz.* They use the square-well, 
Kihara, and exp-6 potentials and make a very thorough comparison to experimental 
data. They find the inclusion of the nonadditive part of U 3 (r u r 2 , r 3 ) gives much 
better agreement with experimental data. This is shown in Fig. 12-7. Their article 
provides a readable and extensive study of virial coefficients and intermolecular 
potentials. 

It becomes increasingly difficult to either determine higher virial coefficients ex- 
perimentally or to calculate them theoretically for any realistic potential. A great 
deal of work has been done, however, in calculating higher virial coefficients for simple 
potentials such as the hard-sphere potential, not to compare such results to experi- 
ment, but to investigate the structure of the statistical mechanical equations themselves. 
For this reason we shall discuss some of the hard-sphere calculations of B 3 through 
B 7 . Most of the results in this area are due to Ree and Hoover.| 



Figure 12-6. The reduced third virial coefficient for the Lennard-Jones potential. The experimental 
valnes have been reduced using values of £ and o determined by fitting the second virial 
coefficient to experimental data. The nonspherical molecules (carbon dioxide and ethylene) 
deviate markedly from the calculated curve. Also the light gases (hydrogen, deuterium 
and helium) exhibit different behavior because of quantum effects. (From R. B. Bird, 
E. L. Spotz, and J. O. Hirschfelder, J. Chem . Phys. 18, 1395, 1950.) 

* A. E. Sherwood and J. M. Prausnitz, J. Chem. Phys., 41, p. 429, 1964. 
t F. H. Ree and W. G. Hoover, J. Chem . Phys., 40, p. 939, 1964. 
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Figure 12-7. Comparison of observed and calculated values of the third virial coefficient of argon. 

Solid lines include a nonadditivity correction; dashed lines show a portion of the additive 
third virial coefficient curve. (From Sherwood and Prausnitz, J. Chem. Phys. 41, 429, 1964.) 



12-5 HIGHER VIRIAL COEFFICIENTS 

FOR THE HARD-SPHERE POTENTIAL 


We have already seen that B 2 = 2no z l3 for the hard-sphere potential. The third 
virial coefficient can be calculated by a geometrical consideration of the overlapping 
of three spheres, but this is a laborious and demanding route. We shall calculate B 3 
here by a method due to Katsura* using Fourier transforms. Let = r j — r x in 
Eq. (12-32). Then 

^ 1 P 2 1 )•/*( 1 03 1 )/( 1 P3 “ P2 I ) ^P2 dp 3 

(1 2-49) 

Now let the Fourier transform of/(|p|) be (cf. Appendix B) 


y(0 = y(| t|) = (2t0" 3/2 J/(| dp 

(12-50) 


(12-51) 

In Eq. (12-51), p is the scalar magnitude of p. The second expression for y(t) is obtained 
by calling t the z-axis of a spherical coordinate system and measuring p relative to t. 
Then we can set t • p = tp cos 0, dp = p 2 dp sin 6 dO d<t> 9 and integrate over 6 and <f> 
(see Problem 12-41). Equation (12-50) can be inverted to give 

f(p) = (2n)- 3 l 2 jy(\t\)e i **dt 


(2\ m r” . . sin pt , 

-(;) J>> , * 

(12-52) 

For hard spheres. 


f(p)=- 1 0<P<1 

= 0 pel 

(12-53) 


S. Katsura, Phys. Rev., 115, p. 1417, 1959. 
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and so according to Eq. (12-51) 


y(0 


_ 3 / 2 V 12 f cosfrrt) 
W l (ot) 2 

3 J 3 iz(. a 0 


= — <7 


(ot) 312 


sin(ut) 

1 w 


where J 3/2 (0 is a Bessel function. 

The trick now is to realize that Eq. (12-52) shows that 


/( I Pa - P 2 1 ) = (2*) - 3/2 J M ‘ P1 dt 


( 12 - 54 ) 


( 12 - 55 ) 


and substitute this into Eq. (12-49) for B 3 to get 

(2 ny 3 ' 2 


b 3 =- 


3 

(2ji) 3/2 


|JJy(0/(P2)/(P3)e i ‘ p3 '* P2 ^p2 <*P 3 


Jdt y 3 (t) 

( 27 t ) 3 / 2 f » .2 [^ H ] 3 

-~l“ a J 0 4,t< ~(o(f I- dt 

f ^3/2W] 3 *" ! 
J 0 


3 •'o 

4n(2n) 3/2 G 6 -°° 


( 12 - 56 ) 


where we have converted the variable of integration from t to x = ot. 

The integral in Eq. (12-56) is not standard, but Katsura* has evaluated many 
integrals of the form 


r°° 

x x J a (x)J fi (x)J y (x) dx 
J o 


of which Eq. (12-56) is a special case. Using Katsura’s paper, Eq. (12-56) becomes 
(see Problem 12-40) 


4n(2n) 3l2 a 6 5 _ 5 n 2 a 6 _ 5 2 

3 ’ 48(2ti) 1/2 ~ 18 “ 8 *° 


( 12 - 57 ) 


where b 0 is the second hard-sphere virial coefficient. 

This is a technique which can be used in many applications involving retype 
integrands. The integration of the most highly connected stars such as H cannot be 
done in this way and are usually done numerically by Monte Carlo methods.! 

The fourth virial coefficient B 4 involves the sum of three types of cluster integrals 
and is given by 


B a = - — JJJJ {3D + 60 + H} dry dr 2 dr 3 dr. 


= (-0.9714 + 1.4167 - 0.1584)V 

= 0.2869 V ( 12 - 58 ) 


Ree and Hoover! have evaluated B 5 through B n by a variety of numerical methods. 


• S. Katsura, ibid. 

t G. W. Bern in Modern Mathematics for the Engineer, ed. by E. F. Bechenbach (New York: McGraw-Hill, 
1956). B. J. Alder and W. G. Hoover, in Physics of simple liquids, ed. by H. N. V. Temperly, J. S. 
Rowlinson, and G. S. Rushbrooke. Amsterdam: North-Holland Publishing Co., 1968. 

% F. H. Ree and W. G. Hoover, J. Chem. Phys ., 46, p. 4181, 1967. 



QUANTUM CORRECTIONS TO B 2 {T) 241 


The number of integrals becomes quite large, there being 468 in the calculation of 
B 7 . They find that 

=0.1097 + 0.0003 

^ =0.0386 ± 0.0004 

V 

Bn 

=0.0138 + 0.0004 (12-59) 

Not even the diagrams have ever been generated for B 8 . These calculations were made 
to study virial coefficient equations themselves and also to determine how much one 
can learn about the liquid state from the first few virial coefficients. They found, for 
instance, that using just the truncated seventh degree polynomial in the density, they 
obtained 10 percent agreement with certain “computer experiments” on hard-sphere 
liquids. (We shall discuss these “computer experiments” later.) A virial expansion 
cannot really be expected to describe a liquid though, since the series itself diverges 
at certain densities and temperatures. Much work has been done examining this series 
and its convergence. This involves quite a sophisticated mathematical discussion of 
the Pj for large j 9 particularly their volume dependence. One of the questions that at 
one time was investigated is whether the phenomenon of condensation occurs at the 
density and temperature at which the virial expansion diverges. It is presently believed 
that these two are not related to each other. 

Up to now we have concentrated on a one-component, classical, monatomic gas. 
It is straightforward to extend these results to two components (see Problems 12-17 
and 12-18) and to polyatomic molecules (see Problems 12-29 through 12-32.) The 
main difference in the polyatomic-molecule case is that the intermolecular potential 
now depends upon the relative orientation of the molecules. This is discussed in 
Sections 3-4 and 3-8 in Hirschfelder, Curtiss, and Bird (see “Additional Reading”) 
We can also extend these classical mechanical formulas to include quantum effects. 
To calculate the second virial coefficient in a complete quantum-mechanical scheme 
requires the detailed scattering state wave functions and phase shifts for w(r), and this 
is very difficult. Chapter 6 of Hirschfelder, Curtiss, and Bird discusses such calculations 
for several potentials, such as the square well and Lennard-Jones. It is possible, how- 
ever, to make an expansion in powers of h 9 and the first few terms not only give a 
measure of the degree of importance of quantum effects, but are also quite tractable. 
In the next section we calculate the first quantum correction to our classical B 2 (T). 

12-6 QUANTUM CORRECTIONS TO B 2 (T ) 

Recall Eq. (10-116) for Q N , namely, 

Qn = E J *1 ■ • ■ dr N 

In Section 10-7 we showed that Q N could be expanded in a power series in h, with the 
leading term being the classical partition function (without the N\ since we did not 
consider the symmetry requirements on the wave functions). We found that 

Qn = py J J ex P(-/W*)*KPi» ■ - - • . P) ■ ■ ■ dl N 
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where 


w(Pi, 


00 


. t n> P) = Z ft ' w j(Pi» ■>*n>P) 
1 = 0 


with 


H>0 = 1, 


W, 


ifi 


2 N 




and 




We can substitute this series into Q N and integrate over the momenta to get 

{InmkT?"' 2 ff h 2 p 2 » 

12 m k 


Q» - JJ j. - 1 (v, vf) + • • •) *, • • ■ *„ 


(12-60) 


The contribution from w x vanishes upon integration, since it is an odd function of 
the momenta. We now divide this by N\ and substitute it into Eq. (12-11) and then 
that into Eq. (12-16) to get 

B 2 =-b 2 = ^(Z 2 -Z t 2 ) 

= —2n f °V*“< r > - 1 ]r 2 dr + - ** - 3 f V*”® V dr + 0{h 3 ) 

J 0 2Anm(kTf J 0 \dr) ' 


(12-61) 

Table 12-4 shows the magnitude of the various quantum contributions to B 2 (T) for 
several gases at various temperatures. It lists not only the first correction which we 
have derived above, but also the second, which goes as Ji 4 , and an Ji 3 -term that arises 


Table 12-4. Contribution of various quantum terms to the second virial coefficient 


gas 

r(°K) 


/i 2 -term 

/?*-term 

ideal quantum gas 

He 4 

27.3 

4.87 

9.16 

-4.05 

0.50 


83.5 

8.87 

1.82 

-0.19 

0.093 


256.0 

11.13 

0.48 

-0.01 

0.017 

h 2 

49.2 

-47.1 

20.68 

8.63 

0.57 


182.8 

7.55 

2.26 

-0.19 

0.080 


592.0 

15.7 

0.49 

-0.01 

0.014 

d 2 

37.0 

78.94 

19.60 

-10.21 

0.31 


182.8 

7.55 

1.13 

-0.05 

0.029 


592.0 

15.7 

0.25 

0 

0.004 

Ne 

35.6 

-66.2 

3.80 

0.47 

0.03 


95.0 

-6.23 

0.55 

-0.01 

0.007 


392.0 

12.1 

0.07 

0 

0.0008 


Source : J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Gases and Liquids (New York: 
Wiley, 1954). 
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when the symmetry requirement of the wave functions is taken into account. This 
A 3 -term is just the quantum-mechanical ideal gas second virial coefficient, Eqs. (10-15) 
and (10-48). You can see from Table 12-4 that this term is not numerically important. 
Clearly the quantum corrections become less important as the temperature increases. 
Table 12-4 indicates that the expansion may not be used below about 40°K for helium 
and 75°K for hydrogen. Of course, it is in the case of light molecules like He and 
H 2 that quantum effects are most important. 


12-7 THE LAW OF CORRESPONDING STATES 

In Sections 12-3 and 12-4 we discussed the second and third virial coefficients for 
several intermolecular potentials. For the Lennard-Jones potential, for example, we 
found that the virial coefficients could be written in a reduced form such that B 2 (T)lb 0 
or B 3 (T)lb 0 2 is a function of only the reduced temperature T* = kT / e. Furthermore, 
assuming that all molecules interact through a Lennard-Jones potential (but with 
different parameters a and e), this function of T* would be the same for all systems. 
This is illustrated in Fig. 12-4 and is an example of the law of corresponding states. We 
have seen examples of this law previously in Chapter 11, where we discussed the 
Debye theory of crystals. In this section we shall discuss a more general version of the 
law of corresponding states for classical monatomic systems. 

We shall assume that the total intermolecular potential can be written in the form 


u = E u(r,j) = 
*. J 



( 12 - 62 ) 


In particular, we are assuming pairwise additivity and that the pair potential can be 
written as an energy parameter e times a function of only the reduced distance r o. 
This pair potential is quite general and need not be a Lennard-Jones potential. How- 
ever, we do assume that the pair potential <}> is the same function for all substances. 
With these assumptions, then, the configuration integral is 


= <t 3 "/(t*,|,n) 




where the function /is the same for all molecules. We have written V/o 3 as one of the 
variables of / since all distances have been reduced by a. 

The N dependence of / is restricted by the fact that the Helmholtz free energy A is 
an extensive thermodynamic quantity. To see this, we use the fact that since A is 
extensive, A/N is intensive and, consequently, must be a function of only v = V/N 
and T. In terms of Z N , A/N is 


A 

NkT 




-S 1 °| +3 "‘ A 


Zn 

Nl A 3JV 
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Since A/NkT is a function of v and T only, N 1 In (Z n *N\) is a function of v and T 
only, which says that 



= a 


3N 



(12-63) 


where g is the same function for all molecules (since / is). 
Thus the partition function is of the form 

_ r <r 3 g{kT t e 9 vla 3 y] N 

Q(N, V,T)= [ - ^3 - J 

The pressure is 

p - kT hr),r 

kT I d Ing \ 

Nc 3 \d(vlo 3 )) N ' T 

kT / d In g \ 

C7 3 ydiv/a 3 ))^ 

which shows that 


PV = l V \l dlng \ 

kT W/l^/Wr* 


(12-65) 


must be the same function of T* = kT;e and v a 3 for all substances. Table 12-5 shows 
the reduced critical parameters for a number of substances. 

A look at Table 12-5 shows that the reduced critical constants of Ne, Ar, Kr, Xe, 
N 2 , and CH 4 are quite similar, and hence these substances obey the law of corre- 
sponding states as we have formulated it. The other substances do not conform to it 
for various obvious reasons. For example, we assumed that classical statistics was 
applicable, and this is not valid for He at 5°K. We also assumed that the intermolecular 
interactions were spherically symmetric or at least effectively spherically symmetric. 


Table 12-5. The critical constants and reduced critical constants, reduced by means of the Lennard- 
Jones parameters 


gas 

7XK) 

tf e (cm 3 /mole) 

Pc(atm) 

T* 

VJNo 3 

PcV e lkT c 

He 

5.3 

57.8 

2.26 

0.52 

5.75 

0.300 

Ne 

44.5 

41.7 

25.9 

1.25 

3.33 

0.296 

Ar 

151 

75.3 

48.0 

1.28 

2.90 

0.292 

Kr 

209 

91.3 

54.3 

1.27 

2.71 

0.289 

Xe 

290 

118.7 

58.0 

1.30 

2.86 

0.289 

n 2 

126 

90.0 

33.5 

1.32 

2.84 

0.292 

co 2 

304 

94.0 

72.8 

1.53 

1.93 

0.274 

OHU 

191 

100.0 

45.8 

1.28 

3.07 

0.292 

n-pentane 

470 

310.3 

33.3 

2.14 

0.84 

0.268 

neopentane 

434 

302.5 

31.6 

1.87 

1.21 

0.268 

benzene 

563 

260.0 

48.6 

2.32 

0.68 

0.274 


Source'. J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Gases and Liquids (New York: 
Wiley, 1954). 
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Table 12-5 shows that this assumption is not good for molecules like pentane and 
benzene. Similar disagreement would have been observed if we had included polar 
molecules. The law of corresponding states can be extended to include polar sub- 
stances and quantum effects, but we shall not do so here.* 

Note that the law of corresponding states provides a useful method for estimating 
Lennard-Jones parameters. For example, for those substances that do obey a law of 
corresponding states, T c * is about 1.3, and vja 3 is 2.7. Thus we have 



*T C 


and 


m* 2n N° 3 , 

Nb 0 = — - — ^$v c 


( 12 - 66 ) 


12-8 CONCLUSION 

This almost concludes our discussion of gases. We have been able to derive many 
both useful and exact results (something which is quite rare) by appealing to the 
grand canonical ensemble. The area of imperfect gas theory is quite well understood. 

Before leaving this, however, let us go back to B 5 for hard spheres and look at its 
calculation a little more closely. B 5 is a sum of ten types of cluster integrals, whose 
values aref 

= J{12O+6O0+ 10a+60®+30&+ 10^ 

+ 15®+30^ + 10#+#}rfr 1 ---rfr 5 (12-67) 

= “ i (- 45 - 70 + 152 72 + 23 43 “ 1 14 28 “ 47 - 55 “ 20 - 55 

+ 17.15 + 39.93 - 9.17 + 0.73)V 
1 3 29 

= " 30 ( ~ 237 * 25 + 233 * 96 >V = ^ V = 0.11V 02-68) 

It would appear that if one must add ten difficult-to-calculate terms, each of the order 
of ten or a hundred, to get 0.11, something is wrong somewhere. Is it possible to 
calculate 0.11 directly? Maybe our whole approach to gas theory is the wrong way to 
go about things, in spite of its rigor and physical appeal. 

Along this same line, the determination of virial coefficients from experimental 
p-V-T data is not at all trivial. The usual method is to curve-fit the data to a poly- 
nomial, but it is well known that the coefficients depend upon the degree of the poly- 
nomial used. One usually uses a greater and greater degree polynomial and waits 
for the lower coefficients to settle down.} A much more satisfactory method is to 
expand p!kT not in a power series, but in terms of orthogonal polynomials in the 
density, since then the coefficients, are independent of the number of terms used. To 


* R. W. Hakala, J. Chem. Phys ., 71, p. 1880, 1967. K. S. Pitzer, J. Am. Chem. Soc., 77, p. 3427, 1955. 
t J. S. Rowlinson, Proc. Roy . Soc., A279, p. 147, 1964. 
t See, for example, K. R. Hall and F. B. Canfield, Physica , 33, p. 481, 1967. 
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see this, consider the set of orthonormal polynomials {<j> /p)} with weighting function 
w(p). 

Then 

and so 

c &T) = \{jp^4>j(pMp)dp 

which is independent of the other C/T ) and any truncation of the series for the 
pressure. Is it possible to find a {4>j{p)} and w(p) such that the Cj(T) can be written 
as integrals over physically appealing molecular aggregates and not involve the delicate 
cancellation that appears in the Mayer theory? 
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PROBLEMS 

12-1. The usual form of a virial expansion is 

pV B(T) C(D 
RT 1H " V V 2 ^ 

Some workers, however, prefer to express their data by expanding the compressibility factor 
in a power series in the pressure 

^=l+F(7>+C'(7> 2 + --- 

Find the relations between the two sets of virial coefficients. 

12-2. Find the second virial coefficient from the Dieterici equation: 


p(v — b)= kTe~ a,kTv 

12-3. Sketch the cluster diagrams corresponding to the following products of /-functions: 

(a) ft 2/2 3/34/4.5/5 i/i 4/25 

(b) /1 2/2 3/1 3/34-/4.5/4.6/56 
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12-4. Find the second virial coefficient and the Boyle temperature for the Berthelot 
equation of state 

/ N 2 A\ 

\p--p I fKv-m=N kT 

where A and B are constants. 

12-5. Derive Eqs. (12-14). 

12-6. Show that the intermolecular potential must vanish more rapidly than r“ 3 in order 
for B 2 {T) to exist. Do this by breaking up the integral in B 2 into two regions, say 0 to L and 
L to oo. Choose L large enough so that the exponential in B 2 can be expanded, and investigate 
this convergence. 

12-7. Show that the second virial coefficient for a Lennard-Jones potential can be written 
in the form 

bo j=o 


with 


b< J > = 


2 J+in 


r 



The quantity T(jc) is the gamma function. 

12-8. Show that the second virial coefficient for the Sutherland potential, defined by 


z/(r) = oo r <o 

= ~cr~ v r> a 


is 


BJJ) = 


2l TU 3 

~3~ 




j 


The parameter y is usually taken to be 6. 

12-9. Calculate the second virial coefficient for the triangle potential* 


w(r) = oo 


r<o 


= -75 o<r<Xo 

cr{A — l) 

= 0 r > Xu 

12-10. Show that 

1 r* du(r) 

B 2 = — r e -»<rWT 47rr 2 & 

6 kTJ 0 dr 

is equivalent to 

1 00 

B 2 = - - f (e- p “ (r) - VflTrr 2 dr 
State the condition on w(r) that is necessary. 

12-11. Find an exact B 2 for the potential u(r) = a/r H for all r with n > 3, using the results 
of Problem 12-10. Carry the integration to the point of having a well-known function in 
your result for B 2 . 


See M. J. Feinberg and A. G. DeRocco, J. Chem . Phys 41, p. 3439, 1964. 
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12-12. Using the tables in Hirschfelder, Curtiss, and Bird (see “Additional Reading”), 
plot the second virial coefficient of argon versus temperature from its boiling point, 84 K, to 
500 U K, assuming a Lennard-Jones potential with parameters given in Table 12-3. Compare 
your result to the experimental values. 

12-13. Using the data in F. Whalley and W. G. Schneider, J. Chem. Phys ., 23, p. 1644, 
1955; A. E. Kingston, /. Chem . Phys, 9 42, p. 719, 1965; B. E. F. Fender and G. D. Halsey, 
J. Chem . Phys. 9 36, p. 1881, 1962; and R. D. Weir, I. Wynn Jones, J. S. Rowlinson, and 
G. Saville, Trans . Far. Soc., 63, p. 1320, 1967, show that the second virial coefficient satisfies 
the law of corresponding states for argon, krypton, and xenon. 

12-14. Using the experimental data in any of the references in Problem 12-1 3, determine e 
and a for the Lennard-Jones potential. 

12-15. Calculate the compressibility factor for argon at 10 atm and 0 J C. 

12-16. Show that the virial expansion for the thermodynamic energy is 

E 3 - 1 dB J+1 , 

NkT 2 £i j dT 9 

and that for entropy is 


5 

Nk 


'ideal 


Nk j?i j 8T^ TBj *^ pJ 


12-17. Show that for a binary mixture. 


— pi 4- pz 4- B 2 o(T)pi z 4- Bn(T)pip 2 4- Bo 2 {T)p2 2 + * • • 

and derive expressions for the second virial coefficients. Show that if this virial expansion is 
written in the form 


-j^=P + B I (T)p* + -~ 
with p = pi 4 * pi , then 

B 2 {x u T) = Xl 2 B 2 0 (T) + jctU - x^B^iT) + (1 - xtfBoJT) 

where is the mole fraction of component 1. 

12-18. Show that for a multicomponent mixture, 

£t=p + Bip i + b 3 p 3 + -~ 

where p = pi + p 2 H h p„ and 

B 2 (T) = t t B,AT)x,xj 

i=i j = i 

B 3 ( T)= J ^ X BijkXiXjXk 

i=i j=i k= i 

where the x's are mole fractions. 

12-19. Using tables for the Lennard-Jones potential, calculate the volume change when 
5 liters of N 2 and 2 liters of Ar are mixed at 10 atm pressure and 300°K. 

12-20. Derive the first few terms of a virial expansion for the fugacity /. 

12-21. For an ideal gas, 

C p — Cy = R 

Derive a virial expansion for this difference for a real gas. 
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12-22. The Joule-Thomson coefficient fx is defined by 



Derive a density expansion for /x. At high temperatures fx is negative, and for sufficiently low 
temperatures, it is positive. The temperature at which fx is zero is called the inversion tempera- 
ture. Show that for not too dense gases, the inversion temperature is given by d(B 2 iT)idT — 0. 
12-23. Show that 



where fx 0 and C p ° are the zero pressure limits of the Joule-Thomson coefficient and C p . 

12-24. Using the result of Problem 1-35, derive an expression for the first nonideal 
correction to the speed of sound in a gas. 

12-25. Calculate the third virial coefficient for the potential* 

oo r <o 

u(r) = ■ — e u <r <2a 

0 r > 2a 


12-26. Derive an expression for the third virial coefficient without assuming that the 
intermolecular potential is pairwise additive. Let U 3 (ti 9 r 2 , r 3 ) be 

U 3 { r x , r 2 , r 3 ) = «(r 12 ) + w(r 13 ) + w(r 23 ) + A (n 2 , r 13 , r 23 ) 

12-27. Derive Eq. (12-51) from Eq. (12-50). 

12-28. Evaluate the ring contribution to B A by the method of Fourier transformations in 
Section 12-5. 

12-29. The Hamiltonian for a gas composed of N rigid rotors is 

H = ? i Pjl +[ ? h (*• : 1 ^ + + uiqi ’ ■ ' ■ ’ qw) 

where q j is the set of five coordinates x j9 y j9 zj,6 Jt and <f> j needed to specify the location and 
orientation of a molecule. Show that the second virial coefficient for such a system is given by 

Bl(T) = ~J2^vS! fl1 dqi dqt 

where 

dqj = dxi dy { dz t sin 6 t d6 t d<j> t 

12-30. The potential between two dipolar molecules (see Fig. 12-8) can be approximated by 
u(r 9 6 l9 0 2 , <f>i - <f>x) = oo r<u 

u 2 

= r g(fiu — r> o 

r 3 

where 

g{ 6 1 , 6 29 <f > i — <f> 2 ) = 2 cos 0 A cos 0 2 — sin sin d 2 cos (<f> 2 — <^i) 

Show that the second virial coefficient for this potential can be expressed as 


B 2 (T) = b 0 



See S. Katsura, Phys. Rev., 115, p. 1417, 1959. 
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Figure 12-8. The coordinates describing the mutual orientation of two polar molecules. (From Hirsch- 
felder, Curtiss, and Bird, Molecular Theory of Gases and Liquids. New York: Wiley, 
1954.) 


12-31. Show that 


„ r” \kT 1 7T 1/2 Vu 2 y i 

= U^ v ‘ r (— erf Tr) ~T dr 


for a potential 

<f>(r, 0 U 0 2 , </>i — <f> 2 ) = Ut(r) + w 2 (r)[cos 2 0i + cos 2 0 2 ] 

12-32. Consider the angle dependent potential 
«(r, cos 6) = « 0 (r)[l + aP m ( cos 0)] 

where PJt cos 0) is a Legendre polynomial of degree m, and w 0 (r) = — a/r". Find an expression 
for the angular correction factor /(a) in the equation 

B 2 [u(r 9 cos 0)] =f(g)b 2 [i4 0 (r)] 

Carry out the integration to obtain a “ simple” expression for f(a) for a “ dipole” interaction 
m = 1 and long-range dispersion forces n = 6. As a check of your result, show 

lim /(a) = 1 

a-* oo 

For a typical value of a ~ t, calculate /(a). 

12-33. Derive the quantum correction to B 2 {T) up through the term in h 2 ; that is, derive 
Eq. (12-61). 

12-34. The hard-sphere virial equation can be written in terms of the variable* 

f— 7W 0 bo 

^ ^ 6v 4v 

where v 0 is the closest packing volume, and b 0 is the hard-sphere second virial coefficient. 
In terms of y 

-g r =l+4y+10>> 2 + 18.36y 3 + 28.2 y* + 39.5y 5 + • • • 

We can approximate the terms in this expansion by 
vv 

= 1+4 y+ IQ y 2 + 18,y 3 + 28,y 4 + 40y 5 H 

kT 

Now if you are clever enough to notice that B n is given by n 2 + n — 2, you can write 

-f~ = 1 + £ (n x + n — 2)y"~ I 
kT " = 2 


♦ See N. F. Carnahan and K. E. Starling, J. Chem. phys. 9 51, p. 635, 1969. 
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Sum this series exactly, and compare with the result of Thiele, J. Chem . Phys ., 39, p. 474, 1963, 
as well as the exact molecular dynamics calculation of Alder and Wainwright, J. Chem . Phys., 
33, p. 1439, 1960. In order to make this comparison, calculate the compressibility factor for 
vfv o = 1.5, 2.0, 10.0. The Alder and Wainwright values are 12.5, 5.89, 1.36, respectively. 
Furthermore, calculate the seventh hard-sphere virial coefficient from this scheme and com- 
pare to the result 0.0138 ± 0.00046 o 6 calculated from cluster integrals. Would you expect a 
phase transition for this infinite series virial expansion? Why or why not? 

12-35. The virial expansion as originally developed by Ursell* was quite different from the 
method presented in this chapter. Consider the Boltzmann factor fV N (y u r 2 , ..., r*), which 
appears in the configuration integral. Ursell expressed W N as a sum of products of functions 
Ui defined by 


U l (r l )=W 1 (r l ) 

U 2 (r t , Tj) = W 2 (yi , rj) - W^dW&j) 

U 3 (r t , r j , r k ) = W 3 (r t , r j , r k ) - W 2 (r t tj) , ^(r*) - W 2 {tj , r k )^i(ri) 

- W 2 {y k , r t ) + 2 Wi(rj) Wi(r k ) 

Show that these (/-functions, now called Ursell functions, are short-ranged functions in the 
sense that they vanish unless all of the / molecules in the argument of (/, are at least singly 
connected. 

Invert these equations to write 


^i(ri) = C/i(r t ) = l 
tV 2 (r tf r,) = U 2 (r if rj) + UMU&d 
fV 3 (r t , r Jf r k ) = U 3 (r t9 r,, r k ) + U 2 (r l9 r/)(7i(r*) 

+ U 2 (r j 9 r k )U 1 (r l )+ U^rdU^j)^ U&dU&dUM 

This expresses the integrand of the configuration integral into a sum of products of short- 
ranged functions. This method can be used to write any property of A-molecules in terms of 
short-ranged functions (c/. Problem 12-36). It is easy to show (although we shall not) that 
the bj of Section 12-1 are given by 

bj = (F/!)- 1 J • • • J Ufa, r, ..... r dtj 

and that the configuration integral can be written as 
N\ imj ) j = i m s 

where the asterisk denotes the condition 

Hmt = n 

The Ursell development has the advantage of not being limited to pair-wise additive potentials. 

12-36. Consider the polarizability of a system of N polarizable molecules a*(ri, . . . , r*). 
In general, this quantity cannot be rigorously written as the sum of N isolated-molecule 
polarizabilities, but can be expressed exactly by 

N 

“n(Fi In) = Y. a t( r i) + EE aiXri.ry) 

1=1 1 XKjXN 

+ EEE “uk(r.,rj,r*)H 

1 Xt<J<k*N 

* See Section 3-2 of Hirschfelder, Curtiss, and Bird in ‘‘Additional Reading.” 
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Show that the a { are polarizabilities of isolated molecules; the a tJ are incremental polariz- 
abilities of pairs of molecules, and so on, or, in other words, by inversion determine the a t , 
a u> a u* # and so on, in terms of the polarizabilities of groups of molecules.* 

12-37, Find the set {mj} which maximizes In Z n /N !, where 

y n {vbj)mJ 
N\ <~, M mj\ 

where the asterisk signifies £ jmj = N as a constraint. Call the undetermined multiplier z. 
The m s * (those m s that yield the maximum term) will come out in terms of z. So will Z n /N \ , 
and hence Q(N, V, T) and p. Solve the equation 

Z jntj* = N 

J = 1 


for z (at least the first few terms of z as a power series in p) and derive the first few terms of 
the virial expansion for p.f 

12-38. Find the first few terms of Z n /N\ 9 written in the form (see Problem 12-37) 

( fli a 2 Qn\ 

1 +y+yi + -+yt) 

expand the logarithm and use 


P 



N 


to find the virial expansion. 


12-39. A model “ potential ” that has been used in the theoretical study of imperfect gases 
is the so-called Gaussian gas, in which it is assumed that the Mayer /-function f(r ) can be 
approximated by a negative Gaussian, that is, by — e~ ar2 , where a is a constant. Show that 
this “ potential ” simulates a soft repulsive potential. The great advantage of this “ potential ” 
is that the many-center integrals involved in the evaluation of the B„ can be done analytically, 
since it can be readily shown by matrix algebra** that 


r 00 r (27 tY /2 

J j dxi dXz dXn !,»'• 


where t = (fi, t 2 , . . . , t n \ x = (x u x 2 , . . . , *„), A is a matrix, and where \A | is the determinant 
of A. Use this to show that for a Gaussian gas, 



B 3 = 0.2576 2 
B*= -0.125Z7 3 


and with patience 

B 5 =0.013M 


* See H. B. Levine and D. A. McQuarrie, J. Chem . Phys., 49, p. 4181, 1968. 
t See Chapter 13 of Mayer and Mayer in “Additional Reading.” 
t See T. L. Hill, J. Chem. Phys., 28, p. 61, 1958. 

** See H. Cramer, Mathematical Methods of Statistics (Princeton, N.J.: Princeton University Press, 1946). 
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The sixth and seventh virial coefficients are also known for this potential, B 6 — 0.03 Sb 5 and 
B n = — 0.0306 6 .* 

12-40. Go to the tables in Katsura’s papert to verify the result for B 3 for hard spheres 
given in Eq. (12-57). 

12-41. Derive Eq. (12 52). 


* See Studies in Statistical Mechanics, vol. 1, ed. by J. DeBoer and G. E. Uhlenbeck (Amsterdam: North- 
Holland Publishing Co., 1962, p. 182). 
t In Phys. Rev., 115, p. 1417, 1959. 
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DISTRIBUTION 
FUNCTIONS 
IN CLASSICAL 
MONATOMIC LIQUIDS 


In the last chapter we have shown how one can derive a rigorous expansion of the 
pressure in terms of the density and thus reduce the many-body problem of a non- 
ideal gas to a two-body problem, a three-body problem, etc. Such a decomposition 
is not applicable to a liquid since each molecule in a liquid is in constant interaction 
with a large number of its neighbors. We shall, therefore, have to use the techniques 
that are more suitable for dense systems The central idea in most theories of liquids 
is the radial distribution function, and this chapter is devoted to a discussion of this 
function and a number of its close relatives. 

The radial distribution function is defined in Section 13-2. Once having shown 
how to calculate thermodynamic functions in terms of the radial distribution func- 
tion (in Section 13-3), we then derive an equation, the so-called Kirkwood equation, 
for the radial distribution function itself (Section 13-4). This equation is not exact, 
and so the rest of the chapter (except for the last sections) discusses the derivation of 
several other equations for the radial distribution function. These equations are 
derived quite differently from the Kirkwood equation, and Sections 13-5 and 13-6 
are needed to provide the background for Section 13-7, in which two of the other 
equations are derived. Section 13-8 discusses the low-density limit of the various 
liquid theory expressions, and the last section is a comparison of the various results 
to experimental data. 

It should be possible for someone who wants only a brief introduction to the theory 
of liquids to skip Sections 13-5 through 13-8. He simply must accept that two new 
equations, the hypernetted chain and the Percus-Yevick, are derived in these sections. 
These equations are somewhat similar to the Kirkwood equation, which is derived 
in Section 13-4. 
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13-1 INTRODUCTION 

The virial expansion has a radius of convergence beyond which the series no longer 
represents the pressure. This point possibly has something to do with the onset of a 
liquid state in which the interactions can no longer be treated as a sequence of two- 
body, three-body interactions, etc. A theoretical treatment of liquids, therefore, 
requires us to seek new methods that are more directed to the many-body nature of 
liquids. Before going on to study some of these methods, however, let us see how 
much one can learn about dense gases and liquids from a knowledge of just the first 
few virial coefficients. Figure 13-1 shows a set of pressure-volume isotherms both 
above and below the critical temperature. To what extent can we reproduce curves 
like this with only a few virial coefficients? This question has been studied by Ree 
and Hoover.* * * § They have used the first five or six hard-sphere virial coefficients to 
calculate equations of state and have compared these to 44 experimental data ” on hard- 
sphere fluids. These 44 data M are actually the results of solving the equations of motion 
of several hundred hard spheres numerically on a large computer and then calculating 
observable macroscopic quantities by time averaging the appropriate microscopic 
equations. Such calculations are called molecular dynamics calculations and have 
been pioneered for hard spheres by Alder and Wainwrightf and later extended to a 
fluid of molecules obeying a Lennard- Jones 6-12 potential by Rahmanf and Verlet.§ 



K = Vfv c 

Figure 13-1. Pressure-volume Isotherms of a real fluid. 


> 


* F. H. Ree and W. G. Hoover, J. Cherh. Phys., 40, p. 939, 1964; ibid. 46, 4181, 1967. 

t See, for example, B. J. Alder and T. Wainwright, in Transport Processes in Statistical Mechanics , edited 
by I. Prigogine (New York: Academic, 1958). 

t A. Rahman, Phys. Rev., 136, p. 405, 1964. 

§ L. Verlet, Phys. Rev., 165, p. 201, 1968. 
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Figure 13-2. Plot of pV/(NkT) versus V Q IV for hard spheres. V 0 is the volume at closest packing, 
N(j 3 /^2. The curves are: (1) virial series including B s , (2) virial series including B 6 , and 
(3) a Pade approximant. Molecular dynamics results of Alder and Wainwright are 
indicated by dots. (From F. H. Ree and W. G. Hoover, J. Chem. Phys ., 40, p. 939, 1964.) 

Figure 13-2 shows the equation of state of a hard-sphere fluid calculated by Alder 
and Wainwright. It is interesting to note that the molecular dynamics calculations 
seem to indicate that a system of hard spheres exhibits both a fluid and a solid branch, 
with the transition occurring at around a volume v = 1.6r 0 » where v 0 , the closest- 
packed volume of a system of hard spheres, is equal to Nc^jyfl. On the same graph 
is plotted the equations of state obtained by truncating the virial series at the fifth and 
sixth virial coefficient. 

The agreement of these truncated virial expansions is satisfactory up to a reduced 
density of about 0.5. It is well recognized in numerical analysis, however, that taking 
just the truncated power series of a function is an unsatisfactory method of approx- 
imating it and more sophisticated techniques are available. One such method is that 
of Pade approximants,* which simply represents the function by a ratio of poly- 
nomials, whose coefficients are found by expanding this ratio in a power series and 
then requiring the first n terms to correctly give the firsts Taylor expansion coefficients 
of the function itself. For example, suppose we know the first n terms in the expan- 
sion of some function f(x ), i.e., we are given 

f(x) =a 0 + a x x + ••• + a n y? + 0(x" +I ) (13-1) 


* A. Ralston, A First Course in Numerical Analysis (New York: McGraw-Hill, 1965). 
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We now construct the so-called N , M Pad e approximant to this function by writing 
c 0 + c x x + ••• +C N _ 1 ** _1 


m 


1 d^x -f • • • + * 


03-2) 


and then finding the c’s and d’s by expanding this in x and requiring the first n + 1 
terms to be the same as those in Eq. (13-1). Clearly, the number of Pade coefficients 
to be determined, M -f N — 1, must equal the number of known Taylor expansion co- 
efficients, n+ 1. This direct expansion of the denominator in Eq. (13-2) can become 
quite laborious, and fortunately the evaluation of the c’s and d ’ s can be written as a 
matrix problem. The appendix of the paper by Ree and Hoover* gives the matrix 
recipe. 

Ree and Hoover* used the first six hard-sphere virial coefficients to construct the 
3, 3 Pade approximant: 



(1 + 0.0635076o P + 0.01 7329* 0 V) 
° P (1 - 0.561493fco P + 0.239465V p 2 ) 


(1 3-3) 


Figure 13-2 shows that this gives excellent agreement with the molecular dynamics 
calculations all along the fluid branch. 

Fourth and fifth virial coefficients have been calculated for the Lennard-Jones 
potential by Barker and Monaghan.*)* They showed that the truncated virial series 
gives critical constants in fairly good agreement with experimental data. The results 
are shown in Table 13-1. 


Table 13-1. The reduced critical parameters obtained from a truncated Lennard-Jones virial series* 



T* 

P* 

Pc*lpc*T* 

virial series through B 3 

1.449 

0.771 

0.339 

virial series through 

1.300 

0.561 

0.347 

virial series through B s 

1.291 

0.547 

0.352 

experimental values 

1.29 

0.66 

0.29 


* The temperature is reduced by elk and the volume by 2irNa 3 /3. 


All of this is fine, but it really avoids the true nature of liquids, namely, that each 
molecule is in constant interaction with all of its neighbors. This makes the virial 
expansion technique not applicable, and we must abandon it and treat the problem by 
methods that are more suitable for dense systems. The central idea in most theories of 
liquids is that of the radial distribution function, which we shall now discuss. 


13-2 DISTRIBUTION FUNCTIONS 


Consider a system of N particles in a volume V and at a temperature T. The prob- 
ability that molecule 1 is in dx 1 at r u molecule 2 in dr 2 at r 2 , etc., is given by 


P m (r lt ..., T „)dr 1 ■■dr N = 


e dr 1 -'- dr 


N 


'TV 


(13-4) 


where Z N is the configuration integral. The probability that molecule 1 is in dt x at 
r l9 . . . , molecule n in dr n at r„ , irrespective of the configuration of the remaining 


* F. H. Ree and W. G. Hoover, /. Chem. Phys., 40, p. 939, 1964. 

t J. A. Barker and J. J. Monaghan,/. Chem. Phys., 36, p. 2564, 1962. 
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N — n molecules is obtained by integrating Eq. (13-4) over the coordinates of mole- 
cules n + 1 through N: 




j... j g fiVK dr„ +1 ■dt N 


( 13 - 5 ) 


Now the probability that any molecule is in dr x at r x , and any molecule is in 
dr n at r„ , irrespective of the configuration of the rest of the molecules, is 


iSM 

p ( ”>(r x , . - • , r„) = • /’‘"Vi. • • • . O (1 3-6) 

This comes about since we have N choices for the first molecule, N — l for the second, 
etc. 

The simplest distribution function is p (1) ( r t ). The quantity p (1) (ri) is the prob- 
ability that any one molecule will be found in dr t . For a crystal this is a periodic 
function of r x with sharp maxima at the lattice sites, but in a fluid all points within V 
are equivalent and so p (1) (r 1 ) is independent of For a fluid, therefore, we can 
write 


j P (1) ( Ti) dt x -- p (1) = ^ = P 


(fluid) 


(13-7) 


where we have used Eqs. (13-4) and (13-6) to equate the integral of p (1) (n) to N. 
We now define a correlation function p <B) (ri r„) by 


p <M) (n, •••,!■„)= P n 9 (n) (r u . . . , r„) 


( 13 - 8 ) 


g {n) is called a correlation function since if the molecules were independent of each 
other, p (w) would equal simply p", and so the factor g {n) in Eq. (13-8) corrects for the 
“ nonindependence ” or, i.e., the correlation between the molecules. 

Using Eq. (13-6) we see that (Problem 13-7) 


p (n) (r„ ...,r„) 


V n N\ 


! Z, 


N"(N — n ) ; ^ 

- V\l + 0(N~ 1 )) f‘ "J c fV ” dT ” +1 


dr 


N 


( 13 - 9 ) 


We shall see shortly that p (2) ( r x , r 2 ) is particularly important since it can be determined 
experimentally. In a liquid of spherically symmetric molecules, p (2) ( r t , r 2 ) depends 
only upon the relative distance between molecules 1 and 2, i.e., upon r 12 . We shall 
usually denote r 12 simply by r, and since we shall concentrate mostly on g {2 \r), we 
shall therefore drop the superscript and write p (2) (r 12 ) = g(r). This is a standard 
notation. 

Now pg(r) dr is the “probability” of observing a second molecule in dr given that 
there is a molecule at the origin of r. Note that this “ probability ” is not normalized 
to unity, but we have instead 


-00 

pg(r)4nr 2 dr = N — 1 « N (13-10) 

In fact, Eq. (13-10) shows that pg(r)4nr 2 is really the number of molecules between 
r and r + dr about a central molecule. The function g(r) can also be thought of as the 
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Figure 13-3. The radial distribution function of a fluid of molecules obeying a Lennard- Jones 6-12 
potential from molecular dynamics calculations. T* — kT/e and p* — cr 3 p. 

factor that multiplies the bulk density p to give a local density p(r) = pg(r ) about some 
fixed molecule. Clearly p->0 as r->0 since molecules become effectively “hard” as 
r 0. Also since the influence of the molecule at the origin diminishes as r becomes 
large, g-+l as r -* oo. g(r) is called the radial distribution function of the fluid. Figure 
1 3-3 show the radial distribution function for a Lennard- Jones 6-1 2 fluid as determined 
from the molecular dynamics calculations of Verlet.* 

The radial distribution function turns out to be of central importance in the theory 
of liquids for two reasons. First, we shall show in the next section that if we assume 
that the total potential energy of the iV-body system is pair-wise additive, i.e., if we 
write 

ty>i> •••,!>, ) = X u(r tJ ) (1 3-1 1 ) 

i<j 

where the summation goes over all pairs of molecules, then all the thermodynamic 
functions of the system can be written in terms of g(r). In addition, the radial dis- 
tribution function can be determined by X-ray diffraction studies on "liquids. In a 
solid the molecules are arranged in a regular repeating order, and this leads to a sharp 
X-ray diffraction pattern from which the order of the molecules in the solid can be 
obtained. The X-ray diffraction pattern of a liquid is more diffuse, but nevertheless 
can be used to determine the local short-range order in a fluid, i.e., the order exhibited 
in Fig. 13-3. The peaks in the curve represent smeared-out shells of first nearest 
neighbors, second nearest neighbors, etc., which in a sense are remnants of the ordering 
found in the solid. 

We can give an outline of the relation between the radial distribution function and 


L. Verlet, Phys. Rev., 165, p. 201, 1958. 
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the scattering of electromagnetic radiation. Problem 13-47 shows that the scattering 
through an angle 0 is given by 

^ 0 3 - 12 ) 

« J sr ij 

where s = (47r/2)sin(0/2). For a liquid the r tj are continuously distributed, and so 
Eq. (13-12) can be written in the form 

moc 

J 0 ^ 

It is convenient to write this as 

m * C*nr\g(f) - 1) — dr + C Anr 2 — dr (1 3-1 3) 

Jo sr Jo sr 

Now it can be shown that this last integral vanishes for s ^ 0 (Problem 13-5), and so 

r °o sin sr 

P(6) oc I 4nr 2 (g(r) — 1) dr 

Jo sr 

or 


P(6) oc J [g(r) — 1 ]e ,s r dr (1 3-1 4) 

which shows that the Fourier transform of g(r) — 1 is proportional to the scattering 
through the angle 0, or really 0/2 since | s | depends upon 0/2 through s = (47c/2)sin(0/2). 

The function g(r) — 1, which goes to zero as r-> 00 , is usually denoted by h(r), 
and the Fourier transform of Hr) multiplied by the density is called the structure 
factor. Thus we can write 

Kr)=g(r )- 1 03 - 15 ) 

and 

fi(s) = p J h(r)e is r dr (1 3-1 6) 

where we have denoted the structure factor by fi{s). 

The above derivation of the relation between X-ray scattering and the radial dis- 
tribution function is incomplete since Eqs. (13-12) through (13-14) omit factors of 
proportionality, and the reader is referred to excellent discussions by Warren,* 
Gingrich, t and KruhJ: for more detail. 

Since we can write thermodynamic functions in terms of g(r) (next section), it is 
therefore possible to determine thermodynamic functions of a fluid from X-ray dif- 
fraction studies. To complete a statistical mechanical theory of liquids, however, we 
shall derive equations that give g(r) from first principles in Sections 13-4 to 13-8. 


* B. E. Warren, /. AppL Phys. t 8 , p. 645, 1937. 
t N. S. Gingrich, Rev. Mod. Phys., 15, p. 90, 1943. 
t R. E. Kruh, Chem. Rev., 62, p. 319, 1962. 



RELATION OF THERMODYNAMIC FUNCTIONS TO g(r) 


261 


13-3 RELATION OF THERMODYNAMIC FUNCTIONS TO g(r) 

We first derive an equation for the energy E in terms of g(r). Using the equation 
Q n = Z n IN\ A 3 *, we can write 

= %NkT+U (13-17) 

where 

p i -iUe~ fV dt 1 --dr N 


The first term in Eq. (13-17) is the mean kinetic energy and the second term is the 
mean potential energy. Now under the assumption of pair- wise additivity, Eq. (13-1 1), 
U is the sum of N(N — l)/2 terms, all of which give the same result upon carrying out 
the integration with respect to r x through r N . Using u(r l2 ) as typical of these N(N — l)/2 
terms in U, we have 


N(N-l) 

2Z„ 

N(N- 1) 
2 


| je pv u(r l2 ) dx 1 • • • dr N 


dr x dr 2 


1 

2 


JJ u ( r i2)P <2) ( r i> r 2 ) *, *2 


N 2 r°° 

= 7 VJ U ( r ^( r ) 4jtr2 dr 


( 13 - 18 ) 


The total energy E is then 


E 

Nkf 


l + 2kT 


f u(r)g(r, p, T)4nr 2 dr 
J o 


( 1 3 - 1 9 ) 


We have written g(r, p, T) explicitly as a function of p and T as well as r to empha- 
size its dependence on these variables. Equation (13-19) could have been derived 
more physically by first choosing any molecule as a “central” molecule. The inter- 
molecular potential energy between this central molecule and other molecules in the 
fluid at distances between r and r + dr is u(r) • pg(r) • 4nr 2 dr. The total potential 
energy of the fluid is obtained by integrating over all values of r and multiplying by 
Njl since any of the N molecules might be “ central.” The factor of two is inserted 
so that each pair interaction is counted only once. This gives Eq. (13-18) for U. 

Next we shall consider the pressure. For large V the pressure is independent of the 
shape of the container, and so for convenience, then, we assume that the container is 
a cube. The pressure p is given by 


P 



(1 3-20) 


where 
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Before differentiating Z N with respect to K, we change variables of integration in such 
a way that the limits become constants and U becomes an explicit function of V. 
Let the new variables be y /, etc., where 

x K = K I/3 V» etc. 

Then 

z » = yN ( f e~' v dxS — dz„' 

J o J o 

U= £ u(r u ) (13-21) 

1 <i<j<N 

and 


F ij 


= K*i - xj) 2 + 0; - yjf + (z t - Zj) 2 ] 112 
= - x/) 2 + O'/ - y/f + (*,' - z/) 2 ] m 


Therefore, 


(13-22) 



where 


L 


e pv dx x ••• dz 


N 


Yl[ x 

kTJ 0 



dx t ' ••• dz N ' 


dU\ y-i du(rij) df*ij y f"i j dufaij) 

fiVf ~ i <i<j<N drij dV i<i<j< N 3V dr X j 


Having carried out the differentiation with respect to V, we now transform back to 
the original variables x x , . . . , z N . Also note that on integrating over the sum, N(N — l)/2 
identical terms result. Therefore we finally get 


(dlnZ N \ N 1 ff du(r l2 ) (2V _ _ w _ ^ 

\~W~J „ T ~v~ evkrn tl 2 ~a~ p (r *’ 2) dTl * 2 


V ~ 12 

which, when substituted into Eq. (13-20), gives 


p p * 00 

kf = P ~6kf< 0 


f ru\r)g(r)4nr 2 dr 
J A 


(13-23) 


This equation is often called the pressure equation. 

We have derived Eq. (13-23) with a liquid in mind, but it is valid for a gas as well 
since a liquid becomes a gas in the limit of low density. In fact, one could call Eq. 
(13-23) a fluid equation of state. In the previous chapter we have derived the equation 
of state of a gas as an expansion in the density p. In order to recover the virial 
expansion from Eq. (13-23), we write g(r 9 p y T) in the form 

g(r, P,T)= g 0 (r, T) + pg x {r, T) + p 2 g 2 (r, T) + •■• 
and substitute this into Eq. (13-23) to get 

h = p -£cr%f C" W' T *" r> * 


(1 3-24) 
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By comparing this to Eq. (12-5), we see that 
B j+2 (T) = ru'irtejir, T)4nr 2 dr 


( 1 3 - 25 ) 


In particular, if we compare this to the first equation in Problem 12-10, we see that 

9 o{r,T)=e- pu (r) ( 13 - 26 ) 

i.e., a simple Boltzmann factor type of expression. Note that this is obtained only in 
the limit of low density. We shall see in Section 13-4 that the energy factor in a Boltz- 
mann expression is not just a potential energy when the system is dense. 

Equation (13-25) has been employed a great deal in the literature to attempt to 
assess the relative merits of various approximate radial distribution functions. The 
procedure is to develop a density expansion of the approximate g(r), substitute this 
into Eq. (13-25), and compare the resultant virial coefficients to the exact ones de- 
termined by the methods of Chapter 12. We shall consider this in some detail in 
Section 13-8. 

We must calculate one last thermodynamic function in order to be able to cal- 
culate all the others. Notice that the two that we have dealt with up to now are both 
mechanical thermodynamic properties. We must calculate one nonmechanical thermo- 
dynamic property, and this is customarily taken to be the chemical potential. This is 
not as straightforward as the calculation of the two mechanical properties, however. 
We could use the thermodynamic equation 



( 13 - 27 ) 


along with Eq. (13-19) for £, but this would require g(r 9 p, T) as a function of T 9 
something which is not available at present (Problem 13-6). 

An alternate method to Eq. (13-27) is to introduce a so-called coupling parameter 
£ 9 which varies from 0 to 1 and which has the effect of replacing the interaction of 
some central molecule, say 1, with the y'th molecule of the system by £u(r lj ). 

In terms of this coupling parameter, then. 


l/(r„...,r„,0= I Mg + I u(r tJ ) 

j=2 2 <i<j<N 


( 13 - 28 ) 


Note that we can take molecule 1 in and out of the system by varying f from 0 to 1. 
This is a useful thing to be able to do since we can write the chemical potential p in 
terms of Z N and Z N - U We say that molecule 1 is “coupled” to the system to the 
degree <!;. All the other molecules of the system interact normally. The radial distribu- 
tion function of the system now depends upon ^ and is written g{r 9 p 9 T; f). Of course, 
g(r 9 p 9 T; 0 cannot be determined experimentally, but we shall derive theoretical 
expression for it. 

Since N is very large, we can write essentially rigorously that 


and since 


( 13 - 29 ) 


= In Z N — In N\ — 3N In A 
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we get 


— = In — In N - In A 3 

kT Z w _! 


(1 3-30) 


The ratio of configuration integrals can easily be written in terms of the coupling 
parameter, as we shall now show. Clearly 

Z W (£ = 1)=Z W 

and 

Z/i(£ =0) = J'Ztf-i (13-31) 

The factor of V in = 0) comes from the integration over dr t . Therefore 


^S- 1 

-as*""- 

(1 3-32) 

where 


z N (0 = 

f . . . f € -pun{Z) . . . dr N 

(1 3-33) 


Now, from Eqs. (13-28) and (13-32), 


dz N 


- - 2; J J [!»(>■„)] *, • • ■ • *, 


dt kT 

By dividing this by Z N and collecting the N — 1 identical integrals, we can get 
d In Zff 


it 

= u (rMn 04w 2 dr 


kT •'o 

Putting this back into Eq. (13-32) gives, then, the final result 

I 1 *°° 

kT Jq •’o 


Vr = to pA ‘ /><- ir dt 


(1 3-34) 


From E , p, and p we can get all of the other thermodynamic functions. We now 
need only some equation for g(r) [or g(r; <*)] itself to have a complete theory of the 
liquid state. It is at this point that we run into difficulty. There have been a number of 
approximate equations derived, and we shall now look at the derivation of some of 
these and see the basis of each one. 


13-4 THE KIRKWOOD INTEGRAL EQUATION FOR g(r) 

One of the first and still one of the most important equations for g(r) was derived 
in the 1930s by Kirkwood and is called the Kirkwood equation. To derive the Kirk- 
wood equation we start with Eq. (13-6): 


P<”>(1, 2 0 


Nl 


J • • • J e~ fv( ® dr„ +l • • - dr N 

(N — n)\ Z w (0 


(1 3-35) 
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where we have written the argument of p (n) as 1, 2, . . ., n instead of r l9 r 2 , . . r„. 
We first differentiate this with respect to ^ to get 


kT 


dp 


(n) 


An) N 


E J j* j) dr r -- dr N 


dt Z N (0 fa 

~ (N — n) \ % 2 i" ' ’ " / ' * >V<0u(ri j) dr " + >"‘ dT » 


(1 3-36) 


The first term here comes from differentiating the denominator of Eq. (13-35), and 
the second term comes from differentiating the numerator. The first integral in Eq. 
(13-36) can be written as 


z N (0 

N(N - 1) 


JJ u ( r ij)p (2> ( r i> r j) dr i dij 


The second integral in Eq. (13-36) involves two separate cases. For j = 2, . . 
u ( r ij) can be taken outside the integral, giving 


Z u ( r ij) • 
J= 2 


Nl J 
(N — n)\ 




z«<0 


P (B) ( 1. 2, E u(r u ) 

J = 2 


n. 


For j = n + 1, . . • , N, we have 

J u ( r u)(j' • J e~ fiV dr n+l • • • dr J _ 1 dr J+l ■ ■ ■ dr^j dr s 

Z n (0(N -w— 1)1 j l/(rj . )p(n+ 1 > ( 1 ) 2, . . . , n,j, 0 dtj 


N\ 


Putting all this together and dividing by p (n) (l, 2, .... n) gives 
d In p (n) 

K 


kT 


- . = ~£ u(rij) + iff “Oi 2 )p (2) ( f j ,* 2,0 dr i dr 2 

OQ j= 2 iV y 

J K w(ri,n+l) p<">(U o dn+l 


(1 3-37) 


where we have taken account of the equivalence of the terms in the two summations. 
We now simply integrate Eq. (13-37) from 0 to £;, noting that [see Eq. (13-35)] 

„(«,n „ m = m f Jexp (-PU N . t )dr n+i -dr N 

P *’• ’ > (N-ri)'.V J | exp(— dr 2 • • * dr N 

= PPn~iK t 2 » • • • » r „) ( 13 - 38 ) 

where here denotes a distribution function in a system containing N — 1 mole- 

cules. The integration gives us 

kT In p (n) (l, . . . > n, f) = kT In p + kT In pJjCj \2, . . . , n) 


- <ZE w ( r u) +^J JJ u ( r i2)P (2) ( r i> r 2 » 0 dr t dr 2 d£ 


j=2 


p (B+1) (l,...,n,/H- 1, 0 


r f /- xP UU •••>*,* 

-JoR^ p w (i, 


o 


dr n+t d{ 

(13-39) 
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As it stands, this equation is applicable to any dense system. We now specialize this 
to the case of a fluid. Let n = 2 and introduce g (2 \r i2 ) to get 


■<7 (3) 0, 2, 3, 0 


-kT In 2, f) = ft/(r 12 ) + p JVr 13 ) [■ (2 )^ 2 q 

-^ 2> (l,3,0]* 9 dS ( 13 - 


-40) 


Notice that this gives g m in terms of g {3) or, in general, gives g^ n) in terms of g {n+1) . 
Such a set of coupled equations is called a hierarchy. It is exact, but unfortunately 
not very usable as it stands. If we could derive some other relation for < 7 (3) in terms of 
g w we could “uncouple” this hierarchy and have a “closed” equation for g w . 
This apparently cannot be done exactly. A further examination of g in) , however, will 
suggest an approximation which can be used to uncouple these equations. 

Define a quantity w (n) (ri> . . . , r n ) by 

g {n \t u . . . , r n ) = e ~ pwinHru - rw) (1 3 - 41 ) 

Substitute this into the defining equation for g itt) [Eq. (13-9)], take the logarithm of 
both sides, and then take the gradient with respect to the position of one of the rt 
molecules, 1, . . . , n. This gives 


-V,.w (w) 


J e~ fiU (—Vj U) dt„+i • • • dtf, 
f ie~ pv dr„ +l •■■dr N 


j = 1. 2, 


,n 


(13-42) 


Now — Vj U is the force acting on molecule j for any fixed configuration r l9 . . . , t N , 
and so the right-hand side is the mean force f/ n) acting on particle j\ averaged over the 
configurations of all the n + 1, . . . , N molecules not in the fixed set 1, . . . , n. Thus 


// w) = -V^w 00 (13-43) 

This says that w (n) is the potential that gives the mean force acting on particle /, or, 
i.e., w (n) is the potential of mean force . In particular, w (2) (r 12 ) is the interaction be- 
tween two molecules held a fixed distance r apart when the remaining N — 2 molecules 
of the fluid are canonically averaged over all configurations. Figure 13-4 shows the 
typical behavior of w (2) (r) = w(r) for a dense fluid. When the density becomes very 
small, the two molecules fixed a distance r apart are not affected by the remaining 
N — 2 molecules, and so w (2) (r) u(r) as p -► 0, in agreement with Eq. (13-26). Note 
that in a dense system, however, the energy term in a Boltzmann factor is the poten- 
tial of mean force rather than just the intermolecular potential. In a sense, one enters 
the realm of rigorous statistical mechanics when this distinction is clearly appreciated. 

We shall see that the radial distribution function of a fluid of hard spheres is quite 
similar to that of a more realistic system. This means that the curves shown in Fig. 
13-4 are also fairly representative of a hard-sphere system. Thus we see that although 
the hard-sphere potential itself has no attractive region, the corresponding potential 
of mean force does. The explanation for this can be seen by considering two hard 
spheres separated by a small distance and immersed in a bath of other hard spheres. 
The collisions that the right-hand sphere suffers with the other N — 2 spheres making 
up the bath will occur mostly from the right since the left-hand sphere of the pair is in 
the way of the bath spheres. There will be a net force to the left on the right-hand 
sphere of the given pair. Similarly, the left-hand sphere will be collided mostly from 
the left and so will experience a net force to the right. Thus the two spheres will be 
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Figure 13-4. The radial distribution function g{r) and the corresponding potential of mean force w {2 \r) 
for a dense fluid. Note that w (2) (r) has minima where g(r) has maxima and vice versa. 

driven toward each other due to the angular asymmetry of their collisions with the 
N — 2 bath spheres and so experience an effective attraction which manifests itself 
by the negative region in the potential of mean force. 

Suppose now that we assume that the potential of mean force for a triplet of mole- 
cules is pair-wise additive, i.e., we assume that 

w (3) (l, 2, 3) » w (2) (l, 2) + w (2) (l, 3) + w (2) (2, 3) (13-44) 

Then this gives us that 

2, 3) » g«\ 1, 2V 2 >(1, 3)*< 2 >(2, 3) (1 3-45) 

Equation (13-45) has a probabilistic interpretation as well as a potential of mean 
force interpretation. If we assume that particles 1, 2, and 3 are completely independent 
of one another, then # (3) (1, 2, 3) would equal # (1) (1)# (1) (2)# (1) (3), essentially the 
product of the separate “probabilities.” We see that Eq. (13-45) then assumes that the 
triplet correlation can be written as a product of pair-wise independent “ probabilities.” 
Substituting this approximation, called the superposition approximation , for # (3) 
into Eq. (13-40) gives finally the Kirkwood equation for g(r), namely, 

—kT in g(r l2 , 0 = frOiz) + P f f tKr t3 )g(r 13 ; C)[g{r 23 ) - 1] dt 3 d? (1 3-46) 

J 0 Jy 

Since the function to be determined in this equation occurs under an integral sign, 
an equation like Eq. (13-46) is called an integral equation. Thus Eq. (13-46) is an 
integral equation for g(r l2 ) and a nonlinear integral equation, in particular. The 
numerical solutions of this equation are difficult to obtain, but can be done on modern 
computers. We shall look at some of these solutions after we have derived a few other 
equations, but we can say here that the g(r l2 ) obtained from the Kirkwood and all 
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the other equations, in fact, are qualitatively satisfactory. Note that Eq. (13-46) 
gives that g(r l2 ) = exp[— &(r l2 )lkT] as p -► 0. 

There is another integral equation for g(r ) that is derived somewhat similarly to the 
Kirkwood equation. Instead of differentiating with respect to a coupling parameter, 
we differentiate with respect to the coordinates of some particular molecule. This 
gives a coupled hierarchy similar to the Kirkwood equations and must also be un- 
coupled by means of the superposition approximation. We shall not go through the 
details, but the result is called the Bom-Green-Yvon equation, which has the form* 

- — [kT In g(r, f) + fu(r)] 
dr 

r 00 r r+s (s 2 4- r 2 — R 2 } 

= n£p f u'(s)g(s, £)ds [ 5 Rg( R ) dR (1 3-47) 

The derivation of this equation is given through Problem 13-13. If one wishes to 
accept that the next few sections derive two alternative integral equations for g(r), 
the hypemetted-chain equation [Eq. (13-84)] and the Percus-Yevick equation [Eq. 
(13-81)], then he can go directly to Section 13-9 for a comparison with experimental 
data. 


13-5 THE DIRECT CORRELATION FUNCTION 


In the late 1950s a new class of integral equations was derived by methods quite 
different from those used to derive the Kirkwood and BGY equations. These new 
equations came not out of a hierarchy, but rather through consideration of another 
type of correlation function, the direct correlation function, which we introduce and 
discuss in this section. 

The distribution functions that we considered before were defined in a closed 
system. Since we are about to generalize these ideas to open systems, let us label these 
as p ! ...» r n ). Then the probability of observing rt molecules in dr 1 • • • dr„ at 
(r lf . . . , rj, irrespective of N, is 

P (n) = I ( 1 3 — 48 ) 

N>n 

where P N is the probability that an open system contains N molecules, namely, 
e^QjN, V, T) z"Z N 
N HO i,V,T) NIS 


Substituting Eqs. (13-5) and (13-6) into (13-48) gives then 


- 1 h"' 1 '- + X , <N^)i J ■ • • • *} 


( 13 - 49 ) 


where Uj denotes the total intermolecular potential of a system of j particles. Note 
that 


J-- Jp (n) (l,.. .,n)dr l ■■•dr„ = - £ 


■ h Z 


N 


N\ 


s ifi n ( N-n)\ £„ Pn ( N-n ) ! 



( 1 3 - 50 ) 


* T. L. Hill, Statistical Mechanics (New York: McGraw-Hill, 1956), Chapter 6. 
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If we put n = 2, we get 

JJ p< 2) ( 1, 2) dr, dr 2 = = <^(N - 1)> = N* - N (1 3-51) 

Furthermore, 

JJ p (1) ( r i)p (1, (r 2 ) dr, dt 2 = (JV) 2 ( 1 3-52) 

Subtracting Eq. (13-52) from (13-51) gives 

^pJJ [p (2) ( r i» r 2 ) - p (1) (ri)p (1) (r 2 )] dr, dr N = ^ ^ - 1 

= pkTK - 1 (13-53) 

where ic is isothermal compressibility [cf. Eq. (3-51)]. For a fluid, Eq. (13-53) becomes 

= 1+ p J [g(r) - 1 ] dr (1 3-54) 

This is the equation we set out to derive. This is essentially another equation for 
p in terms of g(r\ and it often is referred to as the compressibility equation . Note that 
Eq. (13-54) shows that the isothermal compressibility is related to the structure 
factor at s = 0 [cf. Eq. (13-16)]. Also note that the derivation of this equation does 
not require the assumption of pair-wise additivity. This is not so for the pressure 
equation, Eq. (13-23). 

We now introduce one final new function. Clearly h(r Y 2 ) = g(r Y 2 ) — 1 is a measure 
of the total influence of molecule 1 on molecule 2 at a distance r l2 . Many years ago 
Ornstein and Zemike (1914)* proposed a division of h(r l2 ) into two parts, a direct 
part and an indirect part. The direct part is given by a function c(r l2 ) called the direct 
correlation function. The indirect part is the influence propagated directly from mole- 
cule 1 to a third molecule, 3, which in turn exerts its influence on 2, directly or in- 
directly through other particles. This effect is weighted by the density and averaged 
over all positions of molecules 3. With this decomposition of A(r 12 ), we can write 

^ 12 ) = c 0i2) + P J c(ri 3 )/i(r 23 ) dr 3 (1 3-55) 

This equation is called the Omstein-Zernike equation and can be considered to be 
the defining equation of the direct correlation function. 

The direct correlation function does not seem to be as physically intuitive as g(r\ 
but it does have a simpler structure. (See Fig. 13-5.) The direct correlation function 
is important since, as we shall see, it is of much shorter range than h(r), whose “ tail” 
is almost entirely accounted for by the indirect term above. Equation (13-55) has 
the property that if we multiply through by e lk ' (r2_ri) and then integrate with respect 
to dr t and dr 2 , we get 

J /i(ri 2 )e lk ' ri1 *1 dr 2 = J c(r 12 )c <k ri2 dr t dr 2 

+ P JJJ c(ri 3 y k ( -- r -V I (r 23 ) dr, di 2 dr 3 (1 3-56) 


* This article is reproduced in H. Frisch and J. L. Lebowitz, The Equilibrium Theory of Classical Fluids 
(New York: Benjamin, 1964). 
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Figure 13-5. A comparison of the behavior of the direct correlation function and the radial distribution 
fuuction. Note that the direct correlation function has a much simpler structure and a much 
shorter range. 


If we denote the Fourier transforms of h(r ) and c(r) by /?(k) and C?(k), we have 
(Problem 13-14) 

/?(k) = C(k) + pfi(k)C(k) (1 3-57) 

[Anyone familiar with the properties of Fourier transforms would write Eq. (13-57) 
immediately from Eq. (13-55).] 

The compressibility equation [Eq. (13-54)] can be rewritten in the form 


i i 

kT [dpj T 1 +p\Hr)dt 1 + p/?(0) 
= 1 ~P jc(r) dr 


= 1 - pC{ 0 ) 


(13-58) 


where we have used Eq. (13-57) with k = 0. The above manipulations probably 
served as the motivation for Omstein and Zemike to invent Eq. (13-57), from which 
the defining equations for the somewhat physically obscure direct correlation function 
follows immediately. 

Before using the results of this section to derive the two other integral equations, it 
is useful and interesting to study the density expansions of g(r) and c(r). 


13-6 DENSITY EXPANSIONS OF THE 

VARIOUS DISTRIBUTION FUNCTIONS 

In Chapter 12 we derived a density expansion of the pressure by means of the grand 
canonical ensemble. We first found the pressure as a power series in the activity z 
and then were able to convert that to a power series in the density. We can use the 
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same procedure to calculate density expansions of the distribution functions. Equa- 
tion (13-49) already gives p (2) (n> r 2 ) as an expansion in z: 


/-“Vi-rJ-i {«»«-*»■■> + (13-69) 


with 


S - ,+ „?.N! Z » 


It is convenient at this point to factor a z 2 e /?M(ri2) out of Eq. (13-59) and write 


P (2) ( r i> r 2 ) 


Z 2 e -Pu{r l2 ) 




where the prime on U N indicates that the w(r 12 ) term has been omitted. Straightfor- 
ward expansion of this equation yields 

P i2 X*u r 2 ) = 1 + z|j e-M dt 3 - Z, J 

+ y { JJ e-**' dr 3 dr, + 2Z,» - Z 2 - Z, J </r 3 ) + ■ • ■] 

(1 3-60) 

If we assume pair-wise additivity and write exp(— f!U 3 ') as (1 +/i 3 )(l +/ 23 ) and 
recall that Z t = V , we find that 

Je PV * dr 3 —Zi = J{/l3 +fz3 +/l3/23)^ r 3 

Note that the coordinates of particles 1 and 2 are fixed in this equation; i.e., we do not 
integrate over these coordinates. Such particles are indicated by open circles in the 
graphs corresponding to the integrands in the above equation, which becomes in a 
graph notation 


je PU3 ' dr 3 —Z l = j{ /o + -HA }dr. 


(13-61) 


The next term in the expansion of p (2) (r x , r 2 ) can be evaluated in a similar manner 
(Problem 13-8), and one eventually gets 

p (2) (r!,r 2 ) =z 2 e -/?,,(r,2) ^l + z J{ a + cA + A }dr 3 

+ — JJ* {2 n + 2 r +2 D i + +2 n+2N 


+ 2 PI + 


} dr 3 dr 4 + • • • 


(13-62) 


Note that the graphs that appear in Eq. (13-62) are such that there is no /-bond 
between the two root points, but the graph would become at least singly connected if 
there were an /-bond there. This leads us to define modified cluster integrals by 

b t *(r l2 )=e- pu(r ' 2) 

e -P«{ri2) r p 

b,*(r 12 ) = — — j-jdr 3 -dr l+t l C f (/ +1) 1^2 


(13-63) 
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where the summation is over all graphs of / + 1 points, C t {l + 1), which do not have a 
link between the fixed particles 1 and 2 and which would become at least singly 
connected if particles 1 and 2 were linked. The two fixed particles, 1 and 2, are called 
root points ; the other / — 1 particles, which are integrated over, are called field points . 
With this definition, then, Eq. (13-62) takes on the simple looking form, 

P (2 Vi2) = W £W* ,+I 03-64) 

/= 1 

In the last chapter we derived an expression for z in powers of p, namely, 
z = p - 2b 2 p 2 + (8 b 2 2 - 3 b 3 )p 3 + ■ • ■ 

Substituting this into Eq. (13-64) gives 

p (2 W = A V + W - 4 VA)p 3 

+ (3f> 3 * - 12&2*A - 6A*A + 20 b t *b 2 2 )p* + • • ■ (1 3-65) 

The coefficients of each power of p can be considerably simplified, and one eventually 
can get (Problem 13-15): 

P (2 V 12 ) = exp[~ [p 2 + P 3 J A dr 3 

*©J (2 n + 4 + M + R ) rfr 3 

+ J(60 + 6S\+ 12 Q + 12\T)+6f}+6/i+& + 12$& 

+ 3Q + 12#}+ 12^+ 12#S+6^+6#}+6#}+3$&+3^ 

+ 121$ + 6#}+ 61$ + 6lf} + 3t$+ 61$+1$) rfr 3 dx^. dr 5 + ■ ■ 

(13-66) 

The integrals occurring in Eq. (13-66) are closely related to the integrals for the 
virial coefficients. Equation (13-66) suggests, and it can be proved in general, that all 
graphs that become stars when the line corresponding to f l2 is added appear in the 
expansion of p (2) (r 12 ), and, of course, g(r ) as well. Note that p m (r l2 ) ^ p 2 e~ pu(rtl) 
as p -►(). This is the same as w (2) (ri 2 ) w(r 12 ). 

The simpler integrals above can be calculated by the same method that we intro- 
duced in Section 12-5 to calculate the third virial coefficient. It we write 

ff(r 12 ,p, T) = g 0 (r l2 , T) + pg t (r 12 , T) + p 2 g 2 {r l2 ,T) + ••• 

then Eq. (13-66) shows that 

9oir l2 ,T) = e-^ 

and 


PiO-,2 , T) = e _/,u(r,2 > J/(r 13 )/(r 23 ) dr 3 


(1 3-67) 
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It is easy to evaluate gi(r l2 , T ) by the method of Fourier transforms (cf. Section 12-5). 
If we let y(t) be the Fourier transform of f(r); i.e., let 

V(0 = (27t)~ 3/2 J/(r)e _,t ' r dr 

/2\ 1/2 r 00 sin tr 


then one can show that (Problem 13-16) 

\/(r 13 )f(r 23 ) dr 3 = f {y(0} 2 ^i/ 2 ( r i2 0* 3/2 dt 

J r l2 J o 

where J l/2 (x) is a Bessel function of i order. 

It is shown in Section 12-5 that for hard spheres 

... 3 /2\ 1/2 (cos(ert) sin(cr)| 

*°-4) w*— 55H 


= — <T 


*^ 3 / 2(°’0 


(fff ) 3/2 

Substituting this into Eq. (13-68), we get 

r (2 ji\ 3I2 c 112 r« 

j/0-1 3 )f(r 23 ) dr 3 = —^Tj 2 ~ J n J 3l i(xy 3 i 2 (x)J 1/2 (x)x- 3 ' 2 dx 


(13-68) 


Katsura* has evaluated integrals of this type, and we find that (Problem 13-17) 
9i(r i2 , T) =0 0<r 12 <<r 

= 0 r 12 >2a (13-69) 

It is readily shown that this form for g t (r, T) yields the correct third virial coefficient. 
The integrals in g 2 (r 12 , T) can also be evaluated for a hard-sphere potential.! I n 
addition, McQuarrief has evaluated g t (r 9 T) and some of the terms in g 2 (r, T) for the 
square-well potential. 

Equation (13-66) gives us a density expansion for p (2) (r 12 ) in terms of integrals 
over graphs. Since p (2 \r l2 ) is closely related to g(r) and h(r), Eq. (13-66) is essentially 
the expansion for g{r) and h(r) as well. For example, by dividing by p 2 and subtracting 
unity, we get 

h(r 12 ) =/(r 12 ) + J A * 3 + ^- J(2 n + 4 R + M + R ) * 3 * 4 + • • •] 

(1 3-70) 

We can use this to derive a density expansion for the direct correlation function. 
Equation (13-55), the defining equation for c(r), is 

C (f 1 2) = ^( r 1 2) — P J C ( r 1 3)K r 23) ^ r 3 0 3 - 71 ) 


* S. Katsura, Phys . Rev., 115, p. 1417, 1959; 118, p. 1667, 1960. 
f B. R. A. Nijboer and L. van Hove, Phys. Rev., 85, p. 777, 1952. 
t E). A. McQuarrie, J. Chem. Phys., 40, p. 3455, 1964. 
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Now a repeated iteration of this equation, obtained by successively substituting 
c(r 0 ) from the left-hand side into the right-hand side, gives (Problem 13-22) 

c(r 12 ) = /i(r 12 ) - p J K r i 3 ,)K r 23 .) dr 3 + p 2 JJ h(r l3 )h(r 34 )h(r 42 ) dt 3 dr 4 . + • • • 

(13-72) 

If we represent an h - bond by a wavy line, then Eq. (13-72) can be written graphically 
as 


c(r 12 ) = — - p J A dt 3 + p 2 JJ fl dr 3 di 4 - p 3 JJJ O dr 3 dr 4 dr 5 + ■■■ 

(13-73) 

If we now substitute Eq. (13-70) for h(r tj ) into Eq. (13-72), we get a density expansion 
for the direct correlation function, c(r): 

c(r, 2 ) = — + p J A dt 3 

+ p 2 JJ[ +iM+20+iH + iW + ia]dr 3 dr 4 + --- (13-74) 

Note that c(r 12 ) is indeed a much shorter range function than h(r l2 ) since the graphs 
appearing in c(r 12 ) are much more tightly bound. All diagrams in c(r 12 ) are without 
nodes , i.e., a field point at which the graph can be separated into two or more parts 
by cutting at that point. 

Substitution of Eq. (13-66) for p (2) (r 12 ) into the pressure equation [Eq. (13-23)] 
yields the virial expansion, although this is not so easy to show. The substitution of 
the expansion for c(r 12 ) into the compressibility equation [Eq. (13-58)] yields the 
virial expansion in a direct way (Problem 13-23). 

It is possible to extend the results of this section to higher-order correlation func- 
tions as well.* For example, one can show that (Problem 13-24) 

9° ( r i> r 2 > r 3 > = exp{-£[w(r 12 ) + u(r 13 ) + w(r 23 )]} |l + |p\(l, 2, 3)J 

(13-75) 

where 

T,(l, 2, 3) = g w (r l2 ) + g w (r l3 ) + g w (r 23 ) + J/K dr 4 (1 3-76) 

Uhlenbeck and Ford (in “Additional Reading ”) give expressions for the higher-order 
terms. 

13-7 DERIVATION OF TWO ADDITIONAL INTEGRAL EQUATIONS 

The direct correlation function and the Omstein-Zernike equation can be used 
to derive two additional integral equations which appear to be more satisfactory than 
either the Kirkwood or the Born-Green-Yvon equations. They were both derived in 
the late 1950s by methods that are quite different from these we shall use below. One 
of the equations, called the Percus-Yevick equation,! was originally derived by means 

* E. E. Salpeter, Ann. Phys. 9 5, p. 183, 1958; sec also D. Henderson, J. Chem. Phys. 9 46, p. 4306, 1967, for 
for a readable discussion of these expansions. 

f J. K. Percus and G. J. Yevick, Phys. Rev. 9 110, p. 1, 1958. 
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of field theoretic techniques and later derived by Stell* * * § by a detailed analysis of the 
graphical expansions of g(r) and c(r). He showed that the Percus-Yevick equation can 
be obtained by ignoring certain types of clusters that occur in c(r). The other equa- 
tion, called the hypemetted-chain equation, results from the work of a number of 
authors, but the initial impetus can be attributed to Rushbrooke and Scoins,| who 
derived a simpler equation (called the netted-chain equation) by a graphical analysis 
of c(r) and the retention of only certain types of graphs. The names of the resulting 
equations are a reflection of the types of graphs retained. 

The graphical analysis involved in the derivation of the Percus-Yevick and hyper- 
netted-chain equations is quite involved and will not be presented here,* but it played 
an important role in assessing the success of the two integral equations. Later, in the 
1960 s, these equations were derived by an entirely different technique involving the 
theory of functionals.} A functional differs from a function in that whereas a function 
is a mapping from one number or set of numbers into another, a functional is a 
mapping of a function into a number. More precisely, we say that z is a functional of 
the function x[t) in the interval (a, b) when it depends on all the values taken by x(t) 
throughout the interval. A simple example is 

J[*(0] = f *(0 dt 

One that is relevant to statistical mechanics is A[u(r)]; i.e., the Helmholtz free energy 
is a functional of the intermolecular potential u(r). It is possible to develop a calculus 
of functionals, § including such operations as functional differentiation (“ differentiating 
with respect to a function”) and functional Taylor expansions. This is a beautiful 
technique and can be used to derive not only the Percus-Yevick and hypernetted-chain 
equations but most of the other integral equations of liquid theory as well. It is not 
too difficult a technique, but it would take too much space here to present the neces- 
sary calculus of functionals. Several books dealing exclusively with the statistical 
mechanical theory of liquids present this powerful technique. || In addition, the reader 
is referred to a paper by Verlet, 1 f in which he reviews the basic ideas and derives an 
extended Percus-Yevick equation, known in the literature as the PY II equation. 

We shall use neither the theory of graphs nor the theory of functionals, but for 
pedagogical reasons we shall derive the PY and HNC equations by appealing to a 
physical interpretation of the direct correlation function. The Ornstein-Zemike 
equation 

^12) = c(r l2 ) + P J <r l 3 )h(r 23 ) dt 3 

can be considered to be a definition of the direct correlation function or, alternatively, 
can be considered to be a relation between h(r) and c(r). If c(r) were given in terms of 
g(r) or h(r), say, its substitution into the Ornstein-Zemike equation would give a 
closed integral equation for h(r). This can be done in the following approximate way. 


* G. Stell, Physica, 29, p. 517, 1963; see also Stell’s chapter in H. Frisch and J. Lebowitz, ed.. Classical 
Fluids (New York: Benjamin, 1964). 

t G. S. Rushbrooke and H. I. Scoins, Proc. Roy . Soc., 216A, p. 203, 1953. 

t J. K. Percus, Phys. Rev. Lett ., 8, p. 462, 1962. 

§ V. Volterra, Theory of Functionals (New York: Dover, 1959). 

|| G. H. A. Cole, Statistical Theory of Classical Simple Dense Fluids (New York: Pergamon, 1967); S. A. Rice 
and P. Gray, Statistical Mechanics of Simple Liquids (New York: Interscience, 1965). 

H L. Verlet, Physica , 30, p. 95, 1964. 
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The direct correlation function was introduced to represent in a sense the direct 
correlation between two particles in a system containing N — 2 other particles. It is 
reasonable to represent this direct correlation function by 


) 9 total ) 9 indirectCO 


(13-77) 


where £ total (r) is just the radial distribution function itself; i.e., g(r) =exp[— /ftv(r)], 
and <7indirect( r ) is the radial distribution function without the direct interaction u(r) 
included; i.e., we write <7 in direct( r ) = exp{— /?[w(r) — w ( r )]}- Thus we approximate 

c(r) by 


c(r) = c _/,w(r) 


e ~PlMr)-ul r)] 


(13-78) 


It is convenient to introduce one last function at this point. We define y(r) by 


yir) = e^g{r) 


(13-79) 


By comparing this definition to Eq. (13-66), we see that y(r) is defined to go to unity 
as the density goes to zero. It simply eliminates the factor exp (—/to) that occurs in 
Eq. (13-66). In terms of y(r) then, the direct correlation function is 


c(r) =g(r)-y(r ) 

= e~ pu y(r) — y(r) 

—f( r )y( r ) (PY) (13-80) 

This approximation for c(r) is labeled PY since if it is substituted into the Ornstein- 
Zemike equation, one obtains the so-called Percus-Yevick equation: 

y(r t2 ) = 1 +pj f(r l3 )y(r l3 )h(r 23 ) dr 3 (1 3-81 ) 


Note that the direct correlation function given by Eq. (13-80) is a short-range func- 
tion, namely, the same range as /(r) or u(r). 

The hypemetted-chain equation is derived by expanding the # indir< . ct (r) term in 
Eq. (13-78) and writing 

c(r) = e~ 0w - 1 + p[w(r) - u(r)] 

= ff(r) - 1 — In y(r) (1 3-82) 

=f(r)y(r) + [yir) - 1 - In y{r)) (HNQ (1 3-83) 


As the label indicates, if this approximation for c(r) is substituted into the Ornstein- 
Zemike equation, one obtains the hypemetted-chain equation: 




-/[■ 


/!(ri 3 )-ln g(r , 3 ) - 


^( r 13) 

kT _ 


lffO-23) - 1] dr 3 


(1 3-84) 


Although it may not appear so at first sight, this form of c(r) is short ranged since f(r ) 
is short ranged and the bracketed term becomes quite small as r increases much beyond 
the range of the intermolecular potential (Problem 13-26). 

This gives us two more integral equations for the radial distribution function, giving 
a total of four. There are actually a number of others that have been presented in the 
literature, but we shall consider only the four that we have derived so far. We shall see 
in Section 13-9 that it is difficult to compare these equations. They are all qualitatively 
satisfactory, and no one seems to be quantitatively best for all systems under all 
conditions. We shall come back to this in Section 13-9. 
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It is easy and interesting to see what graphs occur in the Percus-Yevick approxima- 
tion to y(r). Write the h(r 23 ) appearing in the Percus-Yevick equation as 

Kr 23 ) = <7023) - 1 = e~’'™y(r 23 ) ~ 1 =/0 23 )^ 3 ) + y(r 23 ) ~ 1 

and substitute this into Eq. (13-81) to get 

y(r i2 ) = 1 + P jf(r l3 )y(r t3 )f(r 23 )y(r 23 ) dx 3 + p j f(r i3 )y(r l3 )ly(r 23 ) - 1] dr 3 

(1 3-85) 

This equation may be solved by iteration to get a series in p, whose first few terms are 
(Problem 13-27): 

y(r) = 1 + p J A dt 3 + p 2 J | fl + PI + N J dt 3 dr 4 + • • • (1 3-86) 

Percus and Yevick proved by simple induction that the diagrams that occur can be 
obtained from the following recipe. Draw a simple chain from base point 1 to base 
point 2 as a plane polygon of ordered vertices (omitting the / 12 bond). Include in the 
sum this simple chain and all diagrams that can be formed from it by adding interior 
lines that do not cross. This expansion turns out to include fewer graphs than the 
y(r) from the hypemetted-chain equation,* but nevertheless we shall see that the two 
equations compare favorably. 

Up to now we have derived the four most popular integral equations for p(r). Notice 
that the Kirkwood equation and the Born-Green-Yvon equation both arise from a 
hierarchy which is broken by the superposition approximation. Presumably both of 
these equations are as good as the superposition approximation. The HNC and the 
PY equations, on the other hand, can be obtained by a variety of methods, all of which 
involve the direct correlation function and the Ornstein-Zernike equation. In the next 
sections we shall compare these four equations to some known exact results and to 
experimental data. 


13-8 DENSITY EXPANSIONS OF THE 
VARIOUS INTEGRAL EQUATIONS 

All four of these equations are nonlinear integral equations and so are rather difficult 
to solve even numerically. Only in the past few years have numerical solutions begun 
to appear. 

There is another way to try to assess these equations that is much easier, however. 
These are equations for liquids. As we let the density p become small, these equations 
should be applicable to dense gases. In the last chapter we treated dense gases 
exactly, and so we can compare the results of a density expansion of our approximate 
liquid theory equations with the exact equations of imperfect gas theory. Let us take 
the Percus-Yevick equation as an example. Write y{r) in a density expansion, 

y(r) = e pu g{r) = 1 + pg t (r) + p 2 g 2 (r) + • • • (1 3-87) 

where the p/r) are as yet undetermined. If we substitute this into both sides of the 

* Rowlinson (in “Additional Reading”) has an excellent, readable discussion of the graphical analysis of 
the PY and HNC equations. 
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Percus-Yevick equation and compare like powers of p on both sides, we get a set of 
simple equations for the gj(r): 

Pffl(ri2 ) + P 2 S2('l2> + • • • 

= P J/13U + W1O13) + O {p 2 ))\e-^ + pe~ Pu23 g t (r 23) + O (p 2 ) - 1 ] dr 3 

which gives 

^ 1 (^ 12 ) = J/ 13/23 * 3 = J A dr 3 
{hi r 12 ) ~ J/i 3 #1 ( r i 3 X /2 3 ds 3 4 - J /13 e P u ( r **)g 1 (^* 2 3 ) dr 3 

= JJ/i 3/1 4/34/2 3 ^3 ^ r 4 + jj/i3(l +/23X/24/34 ^ f 3 ^ r 4 

= JJ fl Jr 4 + 2 JJ N Jr 3 dr 4 

The middle line of Eq. (13-89) is obtained by using 

9 1(^3) = J/i 4/34*4 

in the first integral and 

9 2 3) = J* 724/34 ^ r 4 

in the second. Comparing these results to the exact expansion of g(r ) shows that g x (r) 
is correct and g 2 (r) is incorrect (as in all four equations). (Note how the graphs appear- 
ing in g 2 (r) above satisfy the recipe described earlier and how those that do not appear 
do not satisfy the PY recipe.) 

A look at the pressure equation 

shows that the yth virial coefficient is given by 

Bj{T) = - ru'(r)e- pu( %_ 2 (r, T)4nr 2 dr ( 1 3 - 90 ) 

Since g t (r) is given correctly in all four equations, all four equations yield the correct 
third virial coefficient. Since g 2 (r) is given incorrectly by all four integral equations, 
they all yield incorrect and different fourth virial coefficients. Also, because of the 
approximate nature of g(r ), the fourth and higher virial coefficients depend upon 
whether one uses the pressure equation [Eq. (13-23)] or the compressibility equation 
[Eq. (13-58)] to calculate them. This type of discrepancy has been used extensively to 
compare and assess these four equations.* The results of such calculations for hard 
spheres are shown in Table 13-2. 

Much work has been done in trying to devise schemes in which consistency of all 
of the equations up to the fifth virial coefficient is insured, but the fifth or at least 
higher virial coefficients remain both inconsistent and incorrect.! It is not clear that 

* G. Stell, J. Chem. Phys. y 36, p. 1817, 1962; G. S. Rushbrooke and P. Hutchinson, Physica, 27, p. 647, 
1961, and 29, p. 675, 1963; D. A. McQuarrie, /. Chem . Phys. 9 40, p. 3455, 1964. 

f G. H. A. Cole, Repts. Prog. Phys. y 31, p. 419, 1968. 


(1 3-88) 


(1 3-89) 
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Table 13-2. Virial coefficients for the rigid-sphere fluid from the several approximate theories* 



B 2 P 

Bi' 

B 3 P 

b 3 c 

BS 

BS 

B s p 

Bs c 

exact 

1 

1 

t 

i 

0.2869 

0.2869 

0.1103 

0.1103 

BGY 

1 

1 

* 

i 

0.2252 

0.3424 

0.0475 

0.1335 

K 

1 

1 

* 

i 

0.1400 

0.4418 



HNC 

1 

1 

f 

i 

0.4453 

0.2092 

0.1147 

0.0493 

PY 

1 

1 

f 

t 

0.2500 

0.2969 

0.0859 

0.121 


* These values are reduced by the hard-sphere second virial coefficient b D = 27ra 3 /3. 


such attempts are fruitful, and we quote Rice and Gray (in “Additional Reading”) to 
this effect: “It is important to emphasize that the failure of the theories to reproduce 
the virial coefficients does not necessarily imply that the theories are useless in the 
liquid region. In each case, contributions from all orders of the density are included 
in the integral equation representation. We may, therefore, expect the equation of 
state to be superior to a four- or five-term virial expansion. Indeed, if the contributions 
of each order of the density are those most important in the liquid range (highly con- 
nected diagrams), the approximate theories might be quite good at high densities even 
if the virial coefficients are not exact. It is, therefore, important to examine other 
predictions of the various theories before deciding on theii relative merits.” 


13-9 COMPARISONS OF THE INTEGRAL EQUATIONS 
TO EXPERIMENTAL DATA 

The four integral equations that we have been discussing are all nonlinear integral 
equations. The numerical solution of such equations is quite difficult, and it has 
been only in the past few years that such solutions have been produced. Almost all 
work has focused on two potentials, the hard-sphere potential and the 6-12 potential. 

It turns out that the PY equation can be solved analytically for hard spheres (al- 
though apparently not for disks).* The two equations of state corresponding to the 
pressure equation and compressibility equation are 


p l +2y + 3y 2 
pkT~ (1 -yf 


[from the pressure equation, Eq. (13-23)] 


(1 3-91 ) 


p 1 +y + y 2 
P kT~ (1 -yf 


[from the compressibility equation, Eq. (13-54)] (13-92) 


respectively, where y = npc 3 l6 — nv 0 /6v. Of course, these would be the same if 

the Percus-Yevick equation were exact. The other integral equations must be solved 
numerically. The results of these calculations are plotted in Fig. 13-6. It is not clear 
from the figure, but both the Kirkwood and the Born-Green- Yvon equations do not 
have solutions above certain densities, presumably indicating some kind of a phase 
transition. On the other hand, the Percus-Yevick equation allows the density to 
exceed the closest-packed volume since the equations of state [Eqs. (13-91) or (13-92)] 
predict finite pressures until y-+ 1. This corresponds to v = (>/27i/6)i? 0 , which is 
physically impossible since it is less than the closest-packed volume. Yet it does 
describe the equation of state best over the fluid region. 


* E. Thiele, J. Chem. Phys ., 39, p. 474, 1963; M. S. Wertheim, Phys. Rev . Lett. , 10, p. 321, 1963. 
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P 

pkT 




Figure 13-6. (a) Equation of state of hard spheres calculated from the Bom-Green-Yvon and Kirkwood 
integral equations compared with the results of molecular dynamics calculations. v 0 is the 
closest-packing volume, No 3 /V2 . (b) Equation of state of hard spheres calculated from 
the HNC and Percus-Yevick integral equations compared with the results of molecular 
dynamics calculations. (From D. Henderson, Ann . Rev . Phys. Chem ., 15, p. 31, 1964.) 


The pressure equation can be integrated analytically for hard spheres. Recall that 
the pressure equation is [Eq.( 13-23)] 


Now for a hard-sphere potential, u\r) is zero everywhere except at r = <x, where it is 
essentially a delta function, and so Eq. (13-23) becomes for hard spheres (Problem 
13-33) 


p 2na 3 

W " <+—«*«+> 


(13-93) 


where g(cr+) is the value of g(r) at o + e, where e -► 0 from the positive direction. 
Remember that for hard spheres g(r) is discontinuous at r = a, and so we must specify 
whether g(&) 9 the value of g(r) at contact, is evaluated from the right or from the left. 
Equation (13-93), in conjunction with any of the excellent hard-sphere equations of 
state such as the Ree-Hoover Pade approximant [Eq. (13-3)] or the Carnahan- 
Starling equation (Problem 12-34), gives g at contact. This quantity is sometimes 
useful in transport theories (Section 19-5). 

Remember that both the Kirkwood and BGY equations are derived through the 
superposition approximation, which states that 


g (3) (r t , r 2 , r 3 ) = g l2 \r l2 )g i2 \r i3 )g^\r 23 ) 


Alder,* using molecular dynamics, has made an interesting study of the adequacy of 


B. J. Alder, Phys. Rev. Lett., 12, p. 317, 1964. 
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1.0 1.5 2.0 


x 

(b) 

Figure 13-7. (a) Alder’s test of the superposition approximation for the dense rigid sphere fluid. (From 
B. J. Alder, Phys. Rev. Lett., 12, p. 317, 1964. (b) Rahman’s test of the superposition 
approximation for a Lennard- Jones 6-12 fluid. (From A. Rahman, Phys. Rev. Lett., 12, 
p. 575, 1964.) (From S. A. Rice and P. Gray, Statistical Mechanics of Simple Liquids . 
New York: Interscience, 1965). 

this approximation by computing the function g {2) and comparing to its superposition 
approximation. Alder shows that the superposition approximation may be satisfac- 
tory to within a few percent at high densities in spite of the fact that it is much worse at 
low densities (i.e., giving fourth virial coefficients that are incorrect by 20 percent or 
so). Figure 13-7(a) shows a typical result from Alder’s paper. This seems to indicate 
that the superposition approximation is capable of yielding satisfactory results at 
high densities. Rahman* has found similar results for the Lennard-Jones 6-12 poten- 
tial [Fig. 13-7(b)], although Wang and Krumhanslf have shown that the superposition 
approximation suffers under a more extensive study. 

In an interesting paper, Reiss J has derived the Kirkwood superposition approxima- 
tion by means of a variational principle. His method is applicable to higher-order 
distribution functions as well. 

Figure 13-8 shows the radial distribution function for hard spheres as calculated 

* A. Rahman, Phys. Rev. Lett., 12, p. 575, 1964. 
t S. Wang and J. A. Krumhansl, J. Chem. Phys., 56, p. 4287, 1972. 

X H. Reiss, J. Stat. Phys., 6, p. 39, 1972. 
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1.0 2.0 3.0 4.0 

r/a 

Figure 13—8. The radial distribution for a hard-sphere fluid as calculated from the Percus-Yevick equa- 
tion by Throop and Bearman. (From /. Chem. Phys., 42, p. 2408, 1965). The dash lines 
represent the function y(r) = exp(w(r))p(/*). 

from the Percus-Yevick equation.* * * § Although the equation of state can be written in a 
simple analytic form, namely, Eqs. (13-91) and (13-92), this is not so for the radial 
distribution function. Very accurate radial distribution functions for hard spheres 
have recently been tabulated by Barker and Henderson! by Monte Carlo methods,} 
and it is found that the Percus-Yevick results are quite similar to the accurate results. 
One important feature to notice in Fig. 13-8 is the discontinuity in g(r) at the hard- 
sphere diameter c. This is due to the factor of exp[— /? u(r)\ that occurs in g(r) [cf. 
Eq. (13-66)]. This is exactly why it is convenient to introduce the function y(r ) = 
exp[j Su(r)]g(r). This function is continuous and smooth at r = o since the factor that 
produces the discontinuity in g(r) has been taken out by multiplying by exp [pu(r)\ 
A plot of y HS (r) is given by the dashed lines in Fig. 13-8. It turns out that the form of 
y(r ) is fairly insensitive to the potential, so that a plot of y(r) versus r for a more realis- 
tic potential would appear much the same. 

Although the radial distribution function cannot be determined analytically, the 
direct correlation function can be and is given in papers by Thiele, § Wertheim,||, and 
Baxter.^ 

* G. J. Throop and R. J. Bearman, J. Chem . Phys., 42, p. 2408, 1965. 

t J. A. Barker and D. Henderson, Mol. Phys., 21, p. 187, 1971. 

X The Monte Carlo method is a numerical technique to evaluate multifold integrals. The application of this 
method to the evaluation of the configuration integral of a system involving a few hundred particles was 
pioneered by Metropolis et a! . (J. Chem.Phys 21, p. 1087, 1953) and well reviewed by Wood in Temperley, 
Rowlinson, and Rushbrooke (in “Additional Reading ’*). 

§ E. Thiele, J. Chem. Phys., 39, p. 474, 1963. 

|| M. S. Wertheim, Phys. Rev. Lett., 10, p. 321, 1963. 

^ R. J. Baxter, Phys. Rev., 154, p. 170, 1968. 
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Table 13-3. Comparative values of the compressibility factor z and the entropy AS = — {S — S°)/R for 
a hard-sphere fluid 41 


density 

Kirkwood 

BGY 

PY(c) 

PY(p) 

Pad6 (3, 3) 

CS 

y 

v/Vo 

z 

AS 

z 

AS 

z 

AS 

z 

AS 

z 

AS 

z 

AS 

0.0884 

8.38 

1.44 

0.03 

1.44 

0.03 

1.447 

0.028 

1.444 

0.029 

1.446 

0.028 

1.446 

0.028 

0.1562 

4.74 

1.91 

0.12 

1.93 

0.11 

1.965 

0.101 

1.946 

0.105 

1.960 

0.102 

1.959 

0.102 

0.2128 

3.48 

2.39 

0.24 

2.46 

0.23 

2.579 

0.212 

2.520 

0.219 

2.563 

0.214 

2.559 

0.215 

0.2616 

2.83 

2.89 

0.39 

3.04 

0.37 

3.304 

0.360 

3.170 

0.365 

3.266 

0.362 

3.260 

0.361 

0.3060 

2.42 

3.40 

0.55 

3.65 

0.56 

4.187 

0.548 

3.930 

0.546 

4.110 

0.548 

4.101 

0.547 

0.3444 

2.15 

3.91 

0.73 

4.21 

0.76 

5.192 

0.765 

4.757 

0.747 

5.057 

0.761 

5.047 

0.759 

0.3817 

1.94 

4.43 

0.92 

4.75 

1.00 

6.461 

1.04 

5.757 

0.992 

6.234 

1.03 

6.226 

1.00 

0.4150 

1.78 

4.93 

1.14 

5.33 

1.23 

7.978 

1.36 

6.857 

1.25 

7.619 

1.33 

7.617 

1.33 

0.4515 

1.64 

5.44 

1.37 

5.96 

1.49 

10.03 

1.78 

8.358 

1.61 

9.464 

1.73 

9.474 

1.72 

0.4840 

1.53 

5.95 

1.60 

6.54 

1.77 

12.50 

2.27 

10.031 

2.00 

11.64 

2.18 

11.68 

2.17 

0.5142 

1.44 

6.46 

1.84 



15.52 

2.84 

11.596 

2.46 

14.26 

2.70 

14.33 

2.69 

0.5405 

1.37 

6.99 

2.07 



18.89 

3.44 

14.007 

2.86 

17.14 

3.24 

17.26 

3.24 

0.5700 

1.30 

7.50 

2.32 



23.76 

4.27 

16.845 

3.87 

21.24 

3.98 

21.44 

3.98 

0.5972 

1.24 

7.93 

2.60 



29.89 

5.26 

20.119 

4.07 

26.32 

4.83 

26.63 

4.85 


♦ S° is the entropy of an ideal gas at the same pressure and temperature. 
Source: N. F. Carnahan and K. E. Starling,/. Chem. Phys., 53, p. 600, 1970. 


Table 13-3 compares the values of the compressibility factor z =pvjkT and the 
so-called excess entropy calculated by several methods for a hard-sphere fluid. The 
excess entropy is the difference between the entropy of the system of interest and that 
of an ideal gas at the same pressure and temperature. The excess entropy can be 
calculated in terms of the compressibility factor by means of the thermodynamic 
relation for hard spheres (Problem 13-35): 



f y (* ~ 0 
Jo y 


dy 


(1 3-94) 


where y = 7ra 3 p/6. Since there are two equations of state obtained from the Percus- 
Yevick equation (the pressure equation and the compressibility equation), there are 
two entries for this equation in Table 13-3. The column labeled CS is calculated using 
an equation of state derived by Carnahan and Starling* (Problem 12-34). It can be 
seen from this table that the Percus-Yevick equation gives the best agreement with 
the values calculated from the Pade approximant equation of state* of Hoover and 
Reef or the CS equation of state. 

Before we conclude this discussion of a hard-sphere fluid, we point out an interest- 
ing paper by Rice and Lekner} in which they improve the superposition approxima- 
tion and then use this improved version to truncate the Born-Green-Yvon hierarchy. 
The exact formal expression for the triplet correlation function is given by Eq. (13-75) 
namely, 

0 <3) (1, 2, 3) = exp [— /3 (k 12 + w 13 + w 2 3 )l[l + f V^ 1 . 2 > 3 )] 


Rice and Lekner evaluate the first few terms in the above summation over n and then 
use these to construct a Pade approximant for the summation. The BGY equation 
was then solved using this approximation for g i3 \ and the pressure obtained from the 


* N. F. Carnahan and K. E. Starling, /. Chem. Phys., 51, p. 635, 1969. 
t F. H. Ree and W. G. Hoover, /. Chem . Phys., 40, p. 939, 1964. 

X S. A. Rice and J. Lekner, J. Chem. Phys., 42, p. 3559, 1965. 
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Figure 13-9. The equation of state of the BGY equation for hard spheres with an improved version of 
the superposition approximation compared to several other hard-sphere equations of state. 
(From S. A. Rice and J. Lekner, J. Chem. Phys ., 42, p. 3559, 1965.) 

contact value of g {2) is in almost perfect agreement with the molecular dynamics 
equations of state data of Alder and Wainwright up to p/pkT = 6.80, which is the 
Kirkwood upper limit of the stability of a fluid of hard spheres, i.e., the point beyond 
which no fluid-type solution exists to the BGY equation. Figure 13-9 compares the 
equation of state obtained in this way with several other approximate theories. The 
improvement over the ordinary BGY solution is seen to be quite substantial. 

A comparison of the various integral equations for a more realistic potential such 
as the Lennard-Jones potential is more difficult since in this case temperature enters 
in as another variable. All four of these equations have been solved for the 6-12 
potential. 

Figure 13-10 shows the compressibility and virial equations of state for the Lennard- 
Jones 6-12 potential with T* = 2.74. It can be seen from this that the Percus-Yevick 
equation is the most satisfactory at this reduced temperature. Figure 13-11 shows the 
radial distribution function from the Percus-Yevick equation for the Lennard-Jones 
6-12 potential. Again it can be seen that the comparison to molecular dynamics 
results is quite good. This agreement deteriorates as the temperature gets lower and 
the density gets higher. 

Figure 13-12 shows the Percus-Yevick equation of state for a 6-12 fluid. This 
figure has an entry that we have not yet mentioned. It has been shown by Barker and 
Henderson and colleagues* that one can obtain good results by integrating the 
energy with respect to the temperature to obtain the Helmholtz free energy and then 
differentiating with respect to the density to obtain the pressure. The energy equation 
appears to be less sensitive to errors in g(r), and this is seen in Fig. 13-12. 

Table 13-4 compares the compressibility factor and the average potential energy 
from the Percus-Yevick, hypemetted-chain, and Born-Green-Yvon equations to 
molecular dynamics calculations. The calculations were done at a temperature well 


♦ J. A. Barker, D. Henderson, and R. O. Watts, Phys. Lett., A31, p. 48, 1970; M. Chen, D. Henderson,and 
J. A. Barker, Can.J. Phys., 47, p. 2009, 1969; D. Henderson and M. Chen, Can.J. Phys., 48, p. 634, 1970. 



COMPARISONS OF THE INTEGRAL EQUATIONS TO EXPERIMENTAL DATA 285 



Figure 13-10. The compressibility and virial equations of state for the Lennard-Jones 6-12 potential. 

The BGY, HNC, and PY results are those of D. Levesque ( Physica , 32, p. 1985, 1966) and 
A. A. Broyles et al. (J. Chem. Phys ., 37, p. 2462, 1962) and the Monte Carlo results are 
due to Wood and Parker (J. Chem. Phys., 27, p. 720, 1957). 


Table 13-4. Comparison of thermodynamic functions from the BGY, HNC, PY, and molecular dyna- 
mics calculations at T* — 2.74 


po z 

molecular 

dynamics 

(p/pkTY 

PY 

HNC 

BGY 

0.400 

1.2-1 .5 

1.24 

1.28 

1.26 

0.833 

4.01 

4.01 

5.11 

2.3 

1.000 

7.0 

6.8 

9.1 

3.1 

1.111 

7.8 

9.2 

13.2 

3.8 


molecular 

O/NkT 




dynamics 

PY 

HNC 

BGY 


-0.86 

-0.865 

-0.859 

-0.85 

0.833 

-1.58 

-1.61 

-1.40 

-1.8 


-1.60 

-1.67 

-1.19 

-2.2 

1.111 

-1.90 

-1.59 

-0.78 

-2.6 


Source: S. A. Rice and P. Gray, The Statistical Mechanics of Simple Liquids (New York: Interscience, 
1965). 








Figure 13-12. The Percus-Yevick equation of state for a 6-12 fluid. The three curves marked P, C, and E 
are the Percus-Yevick results from the pressure, compressibility, and energy equations, 
respectively. (From J. A. Barker and D. Henderson, Ann. Rev. Phys. Chem., 23, p. 439 
1972.) 
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above the critical temperature (T* » 1.29). It can be seen that the agreement is 
quite good at the lower densities, but becomes poorer at the higher densities. The 
Percus-Yevick equation gives the best agreement, however. 

Table 13-5 shows a more extensive comparison for the Percus-Yevick equation. 
Again the agreement, in this case with actual experimental data, is quite good. Table 
13-6 shows that such agreement is not found for the pressure, at least, if the Percus- 
Yevick equation is pushed to very low temperatures and high densities. Nevertheless, 
the Percus-Yevick equation seems to be a satisfactory equation up to times the 
critical density and near the critical temperatures. 

Table 13-5. A comparison of the pressure, the internal energy, and the excess entropy as calculated from 
the Percus-Yevick equation with experimental data for fluid argon 

Experimental and theoretical argon pressure, p*lkT* 

T*- 1.3 T* - 1.4 T* ~ 1.5 T* - 2.0 


p* 

Exp. 

Calc. 

Exp. 

Calc. 

Exp. 

Calc. 

Exp. 

Calc. 


0.070 

0.071 

0.075 

0.075 

0.078 

0.079 

0.089 

0.090 

0.2 

0.097 

wmm 

0.113 

0.114 

0.125 

0.127 

0.167 

0.169 

0.3 


EH 

0.136 

0.137 

0.161 

0.164 

0.250 

0.254 

0.4 

0.115 

0.118 

0.164 


0.207 

0.215 

0.362 

0.369 

0.5 

0.153 

0.172 

0.233 

0.249 

0.303 

0.317 

0.546 

0.552 

0.6 

0.311 

0.326 

0.431 

0.437 

0.535 

0.532 

0.881 

0.865 


Experimental and calculated internal energy, E* 


P* 

r*- 

1.3 

T* = 

1.4 

7+ =1.5 

T * 

2.0 

Exp. 

Calc. 

Exp. 

Calc. 

Exp. Calc. 

Exp. 

Calc. 

0.1 

-0.825 

-0.781 

-0.785 

0.748 

-0.755 -0.723 

0.665 

-0.647 

0.2 

-1.581 

-1.534 

-1.499 

-1.451 

-1.442 -1.400 

-1.287 

1.266 

0.3 

-2.233 

-2.181 

2.129 

-2.081 

-2.063 -2.024 

-1.879 

-1.865 

jgm 

-2.785 

-2.734 

-2.705 

-2.640 

-2.647 -2.622 

2.457 

-2.455 


-3.337 

-3.322 

-3.278 

-3.271 

-3.229 -3.226 

-3.030 

-3.042 


-3.934 

-3.938 

-3.876 

-3.885 

-3.826 -3.835 

-3.593 

-3.613 


Experimental and calculated excess entropy, S E /Nk 


P* 

T* = 

1.3 

T* = 

1.4 

T* = 

1.5 

r* = 

2.0 

Exp. 

Calc. 

Exp. 

Calc. 

Exp. 

Calc. 

Exp. 

Calc. 

0.1 

-0.317 

-0.298 

-0.288 

-0.274 

-0.266 

-0.256 

-0.213 

0.212 

0.2 

-0.621 

-0.604 

-0.560 

0.544 

-0.521 

-0.510 

0.429 

-0.431 

0.3 

-0.887 

-0.866 

-0.810 

-0.795 

-0.764 

0.757 

-0.656 

-0.665 

0.4 

-1.116 

-1.096 

-1.056 

-1.053 

-1.015 

-1.022 

-0.904 


0.5 

-1.380 

-1.396 

-1.336 

-1.362 

-1.302 

-1.331 

1.188 


0.6 

-1.728 

-1.772 

-1.684 

-1.733 

-1.649 

1.694 

-1.516 

-1.556 


Source: R. O. Watts, J. Chem. Phys ., 50, p. 984, 1969. 


Table 13-6. Comparison of the Percus-Yevick and molecular dynamics thermodynamic properties for a 
Lennard-Jones fluid 


pressure (fiP/p) energy (fiE/N) 


T* 


Percus- 

Yevick^) 

molecular 

dynamics 

Percus- 

Yevick^) 

molecular 

dynamics 

Percus- 

Yevick 

1.128 

0.85 

1.71 

2.78 

3.57 

-5.05 

-4.98 


0.85 

0.54 

1.64 

3.17 

-6.75 

-6.61 

0.786 


(-0.10) 


2.97 


-7.51 

0.719 




2.82 

-8.51 

-8.28 


Source: F. Mandel, R. J. Bearman, and M. Y. Bearman, J. Chem. Phys., 52, p. 3315, 1970. 
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Figure 13-13. (a) The Percus-Yevick equation of state for the Lennard-Jones 6-12 potential derived 
from the compressibility equation for several isotherms. The curves are (1) T* 1.2; (2) 
T* = 1.263; (3) T* - 1.275; (4) T* — 1.3. --- indicates the region where the P-Y equation 
has no solutions, (b) The Percus-Yevick equation of state calculated from the pressure 
equation. (From R. O. Watts, J. Chem. Phys ., 48, p. 50, 1968.) 


Table 13-7. The reduced critical parameters obtained from various theories of the liquid state* 



T* 

Pc* 

Pc*tPc*T* 

reference 

experimental 

1.28 

0.66 

0.29 


3-term virial series 

1.45 

0.75 

0.34 

(a) 

4-term virial series 

1.30 

0.56 

0.35 

(a) 

BGY(p) 

1.68 

1.08 

0.36 

(b) 

HNC(p) 

1.25 

0.55 

0.30 

(c) 

HNC(c) 

1.39 

0.59 

0.38 

(c) 

PY(p) 

1.25 

0.61 

0.30 

(c) 

PY(c) 

1.32 

0.59 

0.36 

(c) 

van der Waals 

0.30 

0.24 

0.38 

(d) 

Lennard-Jones Devonshire 

1.30 

1.19 

0.59 

(d) 


* The temperature is reduced by ejk, and the density is reduced by 2nNa s f3. 


Sources: (a) J. A. Barker and J. J. Monaghan, J. Chem. Phys., 36, p. 2564, 1962; (b) J. G. Kirkwood, V. A. 
Lewison, and B. J. Alder, /. Chem. Phys., 20, p. 929, 1952; (c) L. Verlet and D. Levesque, Physica, 36 
p. 254, 1967; (d) Chapter 12. 


Figure 13-13 displays a set of p-V isotherms for the PY equation and 6-12 poten- 
tial for both the pressure and compressibility equations. It is interesting to see that 
below T* = 1.275 no solutions were found, possibly indicating a phase transition. 

Table 13-7 lists the critical constants determined from a number of equations of 
state. The van der Waals equation will be discussed in the next chapter. 

Lastly, we mention that the most reliable calculations of the radial distribution are 
molecular dynamics or Monte Carlo calculations. One advantage of this method is 
that the intermolecular potential is known, and so these calculations serve as “experi- 
mental” data on which to test various theories. The molecular dynamics radial dis- 
tribution function for hard spheres was first given by Alder and Hecht* and Barker, 


* B. J. Alder and C. E. Hecht, J. Chem. Phys., 50, p. 2032, 1969. 
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Watts, and Henderson;* * * § the radial distribution function for the Lennard-Jones 
potential has been calculated by Verlet;t and the radial distribution function for an 
inverse 12 potential has been calculated by Hansen and Weis.J A program that 
generates the hard-sphere radial distribution function in excellent agreement with 
the molecular dynamics values is given in Appendix D. This program is due to 
Grundke and Henderson. 

In addition. Barker, Fisher, and Watts § have presented extensive molecular 
dynamics and Monte Carlo calculations, including three-body forces. Molecular 
dynamics and Monte Carlo calculations are particularly useful in the perturbation 
theories to be studied in the next chapter and in the transport theories to be studied in 
later chapters. 
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PROBLEMS 

13-1. Construct a (2, 2) Pade approximant for the pressure of a hard-sphere system and 
compare to the molecular dynamics values. Use the Appendix of the paper by Ree and Hoover |] 
to construct the Pade approximant and the paper of Carnahan and Starling^! for the molecular 
dynamics values as well as some other theoretical expressions. 

13-2. Derive an equation for the thermodynamic energy E in terms of g i2y and g i3y for an 
intermolecular potential that can be written in the form 

U(r t , r 2 , .... r*) = 2 « <2> 0u) + 2 « (3) 0u , O* > f lk ) 

KJ l<J<k 

13-3. Use the pressure equation to derive an equation for the Helmholtz free energy in 
terms of g(r, p, T). 

* J. A. Barker, R. O. Watts, and D. Henderson, Mol. Phys., 21, p. 187, 1971. 

t L. Verlet, Phys. Rev., 165, 201, 1968. 

t J. P. Hansen and J. J. Weis, Mol. Phys., 23, p. 853, 1972. 

§ J. A. Barker, R. A. Fisher, and R. O. Watts, Mol. Phys., 21, p. 657, 1971. 

|| J. Chem. Phys., 40, p. 939, 1964. 

1 /. Chem. Phys., 51, p. 635, 1969. 
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13-4. Show that the fugacity is related to the intermolecular potential by 

In /= \n(pkT) + -£=, f f u{r)g{r, p, £)4? n 2 dr d£ 

fCl J Q Jq 

Using this formula, derive a virial expansion for /. 

13-5. Show that the last integral in Eq. (13-13) vanishes for s ^ 0. 

13-6. Derive an equation for the Helmholtz free energy in terms of g(r, T) by using the 
Gibbs-Helmholtz equation [Eq. (13-27)]. Use this to derive an equation for the chemical 
potential. 

13-7. Show that V n N\:N%N — n)l can be written as V"\\ -f 0(N ~ *)]. This factor occurs 
in Eq. (13-9) for ^ <n) (ri , . . . , r„). Show that this implies that # (2) -* 1 4- O (N ~ *) as r -»■ oo. 

13-8. Derive Eq. (13-62). 

13-9. Define the potential of mean force in words. What does it have to do with the super- 
position approximation? 

13-10. Show that the expression for the chemical potential as an integral over a coupling 
parameter is actually equivalent to the use of the Gibbs-Helmholtz equation and an integra- 
tion with respect to 1 T. 

13-11. Derive a superposition approximation for # (4) by assuming that the potential of 
mean force w (4) is pair-wise additive. 

13-12. Derive the second member of the Kirkwood hierarchy; i.e., using the concept of a 
coupling parameter, derive an integral equation for g i3) in terms of g w . What superposition 
approximation would you make on # (4) ?* 

13-13. Derive the Born-Green-Yvon integral equation [Eq. (13-47)] by taking the gra- 
dient with respect to the position of particle 1 in Eq. (13-9) for p (n) and then manipulating 
much in the same manner as in the derivation of the Kirkwood equation.f 

13-14. If Jf(k) is the Fourier transform of /z(r 12 ) and C(k) is the Fourier transform of 
c(r 12 ), then show that 

C(k)_ l+P^(k) 

by taking the Fourier transform of the Omstein-Zernike equation. 

13-15. Show that 

P w (.r t2 ) = e~ mr ' 2 ' jp 2 + p 3 

13-16. Derive Eq. (13-68). 

13-17. Show that g^r, T) [Eq. (13-69)] is given by 
0i(r, T) = 0 0< r< o' 

(4 r 1 r 3 \ 

7to , 3 {— h — — > o<r<2o 

(3 (7 12 cr 3 J 

= 0 r>2o 

13-18. Show that the density derivative of p n g in) is given by 

djpW'y 1 np" ~ 1 p ( " ) + p" J te 0 ’* 1) (fi , . . . , r.+ 0 — , . . . , r„)} dr* 

ty J, _ 1 +pj{p(r)- 1}* 

* See H. Reiss, J. Stat. Phys., 6, p. 39, 1972, for an excellent discussion of the superposition approximation 
applied to g(+K 

t See T. L. Hill, Statistical Mechanics , Section 33, in “Additional Reading. ” 
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Hint: Start with Eq. (13-49) for p (B) and differentiate this with respect to N at constant 
volume, using the key fact that the dependence of p (n) and N on density at constant tempera- 
ture and volume is only in Z. Thus * * * § we can use 



13-19. Show that the temperature derivative of g(r) involves and g<*\ f 

13-20. Derive an expression for the first two terms in the density expansion of the struc- 
ture factor for a system of hard spheres. 

13-21. For a nonadditive intermolecular potential, y(r) has the density expansion 

y(r) = 1 4- yi iA \r)p + yi iNA \r)p H 

where yi iA) (r) and denote the additive and nonadditive contributions to yi(r\ 

respectively. Show that 

yi iA \r)= J A dr 3 
and that]: 

yi iNA \r) = J exp[— jS(« 23 -f «i 3 )][exp( fans) — 1] dr 3 

13-22. Solve the Ornstein-Zernike equation for c(r) by iteration to derive Eq. (13-72). 

13-23. Substitute Eq. (13-74) into Eq. (13-58) to derive the virial expansion for the 
pressure. 

13-24. Derive Eq. (13-75) for g< 3 \ 

13-25. Substitute Eq. (13-66) into Eq. (13-23) to derive the virial expansion for the 
pressure. 

13-26. Show that the second term in the direct correlation function of the hypernetted- 
chain approximation [y(r ) — 1 — In y(r)] is small as r increases beyond the first maximum in 
g(r), say. 

13-27. Show that y(r) from the Percus-Yevick equation is given by Eq. (13-86). 

13-28. Show that all four of the integral equations discussed in this chapter yield the 
correct third virial coefficient. 

13-29. Show that the hypernetted-chain equation leads to a density expansion for y(r) 
in the form 

y(r) = 1 4 y^p + y 2 (r)p 2 + • • • 
where§ 

J'i(r) jAdr 3 

yz(r) JJ* [i H -h FI +2 FI] dr 3 dr* 

13-30. Make the simple approximation that c(r) = f(r) and derive an integral equation 
for h(r) by substituting this into the Ornstein-Zernike equation. Solve this equation by itera- 
tion and derive a density expansion for h(r). Can you identify the general term graphically? 
Solve this equation for the Fourier transform of h(r ). Can you invert this? Does it give a 
correct third virial coefficient? Calculate the structure function h(k) and plot this versus ko . 


* See P. Schofield, Proc. Phys. Soc 88, p. 149, 1968. 

t See P. Schofield, Proc . Phys . Soc., 88, p. 149, 1968. 

t See C. T. Chen and R. D. Present, J. Chem. Phys., 53, p. 1585, 1970. 

§ See D. A. McQuarrie, J. Chem. Phys., 40, p. 3455, 1974. 
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13-31. Invent a reasonable approximate form for c(r ) and derive your own integral 
equation for h(r). Does it reproduce any virial coefficients? Can the equation be solved by 
iteration to derive a density expansion for h(r)l 

13-32. Prove generally that the superposition approximation yields the correct third 
virial coefficient. 

13-33. Show that the equation of state of hard spheres can be written 


p 2 ttdo 3 

W T~ ' + — <*>+> 

where g(o +) is the value of g(r) ata-fsasc-^O from the positive direction.* 
13-34. The entropy per molecule of a monatomic ideal gas is given by 

Ts = kT\n v - kT\n A 3 + §kT 

Define an excess entropy per molecule s v E at any volume v by 
s = k In v — k In A 3 -}- §k 4- s v E 
Show that 


by integrating the thermodynamic equation 
dv T constant 


Ml) 


between v and v° where v° ->■ oo.f 

13-35. Define excess thermodynamic quantities at any pressure p by 


kT\np + kT\n\ — \ + *x P 


(S) 


E=iNkT+E P ' 


(I) 


S — —Nk In p — Nk ln( — J 4- §Nk 4- S t 

First derive the equation 

TS P E = E P E - Nfi p E 4- (pV- NkT) 

Using the fact that E E = 0 for hard spheres and that 

r v ° 

A°-A(V)= - pdV=(NfjL 0 -NkT)~(Np-pV) 

Jy 

derive the equations 


kT 




and 




* See T. L. Hill, Statistical Mechanics, Section 35, in “Additional Reading.'* 

f See T. L. Hill, Statistical Mechanics , Section 36, in “Additional Reading, *’ for an application of this 
equation. 
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13-36. Using the Camahan-Starling equation of state (Problem 12-34) 

l + y + y 2 — y* 

P kT~ (1 -yY 

where y = 7rp(j 3 /6, derive analytical expressions for the excess entropy, excess energy, excess 
Helmholtz free energy, excess enthalpy, and excess Gibbs free energy for a hard-sphere 
fluid.* 

13-37. We shall see in the next chapter that one can obtain an excellent approximation 
to the correlation function y(r) associated with the intermolecular potential u(r) by writing 

y(r) e- puir) y d (r) 

where y d {r) is the correlation function of a system of hard spheres of diameter d. In the next 
chapter we shall give a prescription for the choice of the value of d, but for now simply test 
this approximation for an inverse-12 potentialf u(r) = 4e(cr/>) 12 and the Lennard-Jones 6-12 
potential} by varying d around a. For the hard sphere y(r), use either the molecular dynamics 
data of Alder and Hecht§ or the program listed in Appendix D. 

13-38. Derive expressions for the pressure and thermodynamic energy of a multicompo- 
nent system, namely, 

w~ l ~ 2 i^l x,Xi 1 9ij(r)u ‘ iXr)r3 dr 

E = §NkT -|- lirNp 2 f 9 u(r)ui^r)r 2 dr 

i.J J o 

where x t is the mole fraction of component /. 

13-39. The generalization of the Omstein-Zemike equation for mixtures is 

htj(r 12) = C|j(r 1 2) -f- p 2 Xi J* hn( r i 3 ) c u(. r 23) dr 3 

Use this to derive the Percus-Yevick equation for a multicomponent system. This equation 
has been solved analytically for additive hard spheres by Lebowitz! and Baxter.^ Lebo- 
witz’s paper gives an analytical expression for the Laplace transform of rg tJ (r). 

13-40. A difficulty with the radial distribution function approach to solutions is that the 
radial distribution function of a mixture depends upon so many variables, such as the densi- 
ties of each component, the temperature, and the intermolecular potential parameters e i3 
and a ij9 say for a Lennard-Jones potential. There is a set of approximate mixture theories 
that attempts to treat the solution as an ideal mixture of pure fluids with an effective inter- 
molecular potential. These theories are called / 2 -fluid theories, the simplest of which is the 
random mixture theory . In this theory, we start with 

E= %NkT+2nNp 2 XiXj f Ui/r )g tJ (r )r 2 dr 

i.J J 0 

and let 

9n(r) = gn(r) = g 12 (r) = • • • = gir) 

* See Problems 13-34, 13-35 and N. F. Carnahan and K. E. Starling, J. Chem. Phys. 9 53, p. 600, 1970, for 
the definition of excess thermodynamic quantities, 
t J. P. Hansen and J. J. Weis, Mol . Phys. 9 23, p. 853, 1972. 

} L. Verlet, Phys. Rev. t 165, p. 201, 1968. 

§/. Chem . Phys.> 50, p. 2032, 1969. 

|| Phys. Rev., 133, p. A895, 1964. 

H/. Chem. Phys., 52, p. 4559, 1970. 
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to get 


E = iNkT+lirNp f u(r)g(r)r 2 dr 

» n 


where 


«(r)= 2 xtXjUtjir) 

l,J 

Thus we see that the random mixture theory treats the mixture as a single pure fluid with an 
effective intermolecular potential given above. Hence it is often called a one-fluid theory . 
Show that if u l3 (r) = 4£ij{(pij/r) 12 — ( o i3 lr ) 6 }, then the random mixture theory uses an ef- 
fective Lennard- Jones potential with 

CLi.jXtXjeijOtf) 2 

XiXjEijOij 12 

f I<.jXiX J e lJ o lJ ' 2 y /6 

* \ '2j.jXiX J e t ja t j e I 

13-41. A more sophisticated approximation than the random mixture theory is the 
average potential model , in which one assumes that 

gu(r) = M^n(r) + ^ 22 (r)] 

Show that is the case that the effective intermolecular potential is 
Ut(r) = x t uu{r) + xj u i3 (r ) 

which for a Lennard-Jones potential gives potential parameters 

_ (2 J x J £ iJ a u 6 ) 2 
IjXjBu o tJ 12 

2 jX 3 £ i3 o l3 € J 

Thus a two-component solution is treated as a mixture of two pure pseudocomponents, and 
hence this approximation is often called a two-fluid theory . 

13-42. If we assume that the intermolecular potential is of the form 
«v(r) = *ijF 

where F is some universal function, then the energy can be written as 
E = %NkT + 2irNp 2 XiXjEtj f Ft- — ) g i3 (r)r 2 dr 

l.J J o \<W 

Now the g i3 (r) are functions that scale as 
9u = Qu 





Show that 


E - iNkT+lirNp 2 XiX 3 E i3 o i3 3 

t.j 


J n 
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0.5 1.0 1.5 

r l°22 

Figure 13-14. Typical radial distribution functions of a two-component mixture. 

Recall that the g tJ are functions of the x ly p, o 22 faii> kT/eu as well as r iJ * = rla lJ . Some 
typical values of g iS are shown in Fig. 13-14. From this figure we can see that the approxi- 
mations made in the random mixture theory and the average potential model are quite poor. 
A much better approximation is to let 

01 l(Tl 1 *) = 012 ^ 1 2 *) = 022 ^ 22 *) = 0&*) 

Show that this leads to 

£= %NkT-\- IttNp 2 XiXjEijOu 3 f F(x)g(x)x 2 dx 

i,J J o 

This equation is of the one-fluid form if we interpret g(r) as the distribution function corre- 
sponding to a system in which 

ea*= 2 XiXjEtjoij* 
i.j 

This is called the van der Waals 1 theory and is discussed in some detail by Henderson. 

13-43. In Problem 13-42 we developed the so-called van der Waals 1 theory. We can 
derive a van der Waals 2 theory by assuming that 

0 ii(ri 2 *) = il 0 ii(ru*) + 022 (r 22 *)] 

Show that this leads to 

E = iNkT+ 2t tN P 2 XiEtoS f F(r lt *)g lt (ri t*)r* 2 dr u * 

i J o 

where 

El a 1 3 = XiEu < 7|| 3 + XjElj Oij 3 

Figures 13-15 through 13-17 show some excess thermodynamic properties for the van der 
Waals 1 theory. Excess thermodynamic functions are defined as the difference between the 
thermodynamic function of mixing and the value corresponding to an ideal mixture at the 
same temperature, pressure, and composition. 
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Figure 1 3-15. Excess free energy of mixing as a function of o 22 loi 2 for an equimolar mixture at 97°K and 
p = 0 (e 12 /k = 133.5°K, a 12 = 3.596 A. The points give Monte Carlo data of Singer 
and Singer (Mol. Phys., 24, p. 357, 1972), the dotted curve gives the van derWaalsl 
theory, and the solid curve gives the results of perturbation theory (see Chapter 14). 
The curves are labeled with appropriate values of £ 22/112 - (From E. W. Grundke, 
D. Henderson, J. A. Barker, and P. J. Leonard, Mol. Phys. 9 25, p. 883, 1973.) 



Figure 13-16. Excess heat of mixing as a function of 022/012 for an equimolar mixture at 97°K and 
p — 0 (e 12 /k — 133.5°K, o 12 = 3.596 A. The points and curves have the same interpre- 
tation as in Figure 13-15. 
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Figure 13-17. Excess volume of mixing as a function of a 22 /o-i 2 for an eqnimolar mixture at 9TK and 
p = 0 (£i 2 /& = 133.5°K, a 12 = 3.596 A. The points and curves have the same interpre- 
tation as in Figure 13-15. 


13-44. Show that the Camahan-Starling hard-sphere equation of state (Problem 12-34) 
can be written in the form 

Zcs — i(2Zpy C -f- Z P Y P ) 

where Z is the compressibility factor p/pkT, and Z PY C and Z PY P are given by Eqs. (13-92) 
and (13-91), respectively. Mansoori et al* have used this observation to derive the corre- 
sponding result for mixtures. Using the equations of state from the Percus-Yevick equation 
for a mixture of hard spheres,t derive an equation of state for a mixture of hard spheres. 
Mansoori et al . have used this equation of state to calculate the thermodynamic properties 
of a mixture of hard spheres. 

13-45. Molecular dynamics calculations have proved to be quite useful in generating 
“experimental data” to which one can unambiguously compare various theories. The data 
are the results of solving the equations of motion of N particles under a specified known 
potential. The observable macroscopic properties are then calculated by time averaging the 
appropriate microscopic quantities. Write down the equations of motion of a small system, 
say five particles, under a hard-sphere potential and indicate how the observed macroscopic 
properties, such as the energy or the pressure, would be calculated. 

13-46. Using Eq. (13-74), show that for hard spheres the coefficient of p is the linear 
term in the density and is of the form of a cubic polynomial in r with the quadratic term 
missing. It so happens that this same functional form is found in the p 2 term as well. This 
observation played a central role in Wertheim’s analytic solution of the Percus-Yevick equa- 
tion for hard spheres.t 

*/. Chem. Phys., 54, p. 1523, 1971. 
t J. L. Lebowitz, Phys. Rev., 133, p. A895, 1964. 
t (See Phys. Rev . Lett . , 10, p. 231, 1963.) 
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Figure 13-18. The geometry used to calculate the intensity of radiation scattered through an angle#. 

The plane A represents the incident radiation, and B represents the plane of an observer. 


13-47. Consider the light scattered from a system of N scatterers. Let the incident 
oscillating electric field be propagated in the direction of the unit vector u 0 and let the 
scattered field be in the direction of the unit vector u. Figure 13-18 shows these unit 
vectors and the position of the /th scatterer with respect to some arbitrary origin. The 
location of this origin can have no effect on the final result. Let the incident radiation be 
assumed to have the same phase over the plane A , and let plane B represent the position of 
an observer of the scattered radiation. The radiation arriving at B will, in general, no longer 
be in phase because of the extra distance a H- b traveled by the upper beam in the figure. 
Hence the intensity of the radiation arriving at B will be less than the incident intensity. 
Define a quantity P(8) by 


^ scattered intensity from a collection of scatters 

P(8) = : 

initial intensity 

Note that as 8 -^0 in Fig. 13-18, the distance a + b-> 0, and hence the intensity of scattered 
radiation equals the intensity of incident radiation. Thus 


intensity of radiation scattered through an angle 8 
intensity of radiation scattered through an angle 8 0 

m 

m 


(13-95) 


where 1(8) is the intensity of radiation scattered through #. The quantity P(8), which can be 
determined experimentally, yields much information concerning the configuration of the 
collection of scatterers. First show that the extra distance traveled by the radiation scattered 
by the /th element can be written as 


a + 6 = rj-(u 0 — u) 


Now let n be a unit vector in the direction of u 0 — u and show that the vector u 0 — u equals 
2n sin 8 / 2 and 

8 

a + b = 2tj • n sin - 
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The phase difference <f> 3 due to this extra difference is (< a + b) ; A, where A is the wavelength 
of the radiation, and so 



The electric field at B due to the scattering by the element at j is 
S j — A cos[27r(v/ — <f> j)] 


(1 3—96) 


where v is the frequency of the radiation, and A can be considered to be simply a constant. 
The electric field from the collection of N scatterers is 


S s = 2 ^ cos[27 r(vt — </> j)] 
j = i 

The intensity is the energy that falls on 1 cm 1 2 of area per second and can be obtained by 
averaging S 2 over one period: 

f TdtS 2 r 1/v /" \ 2 

m= -JTdT = vA2 i cos “7 

where a 3 — 27r{vt — <j > 3 ). Show that this is equal to 

1(6) = 4- Z 2 cos Irt&t — 4>i> 

L 1 = 1 j= 1 

As 6 ->-0, each <f> t ->-0, and so 1(6) -+A 2 N 2 I2. The ratio of 1(6) to 1(0) then is 
P(6) = -^11 cos[2jt(^, — <f>j)] 

Jy i=i j= i 


= rji 2 2 cos[«n • (r, - r,)] 
TV i = i j= i 


(1 3-97) 


where 


477 . 8 

s = sin - (13-98) 

Note that only the difference in the positions of particles appears here and that the arbi- 
trarily placed origin in Fig. 13-18 drops out of the equations. 

Equation (13-97) is for a collection of scatterers in a fixed orientation with respect to the 
incident radiation. We must now average Eq. (13-97) over all orientations. This can be done 
by averaging over all orientations of n. To do this, take a spherical coordinate system with 
the 2 -axis to be along r, — r 3 with the angle between n and this 2 -axis denoted by a and the 
other angle in the spherical coordinate system denoted by jS. Show then that dn = sin a dec dp 
and 


1 r 2n r" 

cosfyn • r<j] = — cos (sr tJ cos a)sin a dec df$ 

477 J o * O 

sin sr tJ 
sr i3 

The scattering function P(6) is given by 


Inn 

*«>=T5 2 2 

/V < = i j = i 


sin sri 3 
sr l3 


(1 3-99) 



CHAPTER 14 


PERTURBATION THEORIES 
OF LIQUIDS 


In the previous chapter we have studied the radial distribution function theory of 
liquids. This constitutes a rigorous statistical mechanical theory. Although we even- 
tually had to introduce some sort of approximations into the formalism, we at least 
started from general principles. Because of this, however, the results are not particu- 
larly simple to use in practical calculations. There is an interesting and important result 
from the developments in Chapter 13 that we shall exploit in this chapter. Much 
evidence points to the fact that the structure of a liquid is primarily determined by 
the short-range repulsive forces and that the relatively longer-range attractive part of 
the potential provides a net force that gives a somewhat uniform attractive potential. 
Thus, in a sense we picture the repulsive part of the potential as determining the 
structure of the liquid and the attractive part as holding the molecules together at 
some specified density. 

For example, except for the discontinuity at r = a, the radial distribution function 
of a hard-sphere fluid is fairly similar to that of a real fluid. In particular, if this dis- 
continuity is removed by multiplying g{r) by exp[w(r)] to give y(r), we find that this 
function is quite similar for hard spheres and real fluids. The function y(0 is shown in 
Fig. 13-8 for hard spheres, but a similar result would be found for a Lennard-Jones 
potential. 

Molecular dynamics calculations have also been used to show that the structure 
of a real fluid is similar to a hard-sphere fluid. Verlet* has carried out extensive 
molecular dynamics calculations on a system of molecules interacting with a Lennard- 
Jones 6-12 potential and has calculated the so-called structure factor h(k) of the fluid. 
The structure factor is related to the Fourier transform of h(r) = g(r ) — 1, i.e., by 

K(k) = pj e~ ik ' r h(r) dr (14-1) 

Verlet calculated fi(k) since it is directly measured by X-ray scattering from the liquid. 


* L. Verlet, Phys. Rev., 165, p. 201, 1968. 
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(a) 



(b) 

Figure 14-1. The structure factor, fi(k), plotted as a function of the wave vector ka. The circles are the 
results from molecular dynamics calculations; the solid line is a hard-sphere model with 
an effective hard-sphere diameter; and the crosses represent neutron diffraction data. In 
(a) the reduced temperature is 1.326 and the reduced density is 0.5426. In (b), these values 
are 0.827 and 0.75, respectively. (From L. Verlet, Phys. Rev., 165, p. 201, 1968.) 

Figure 14-1 shows two graphs taken from Verlet’s paper. The dots are the molecular 
dynamics results, and the solid lines in both Fig. 14-1 (a) and (b) are the results for 
a hard-sphere fluid with an effective hard-sphere diameter that is a function of the 
temperature and the density. The important point is that at high densities the structure 
factor of the real (Lennard- Jones 6-12) fluid can be well represented by a hard-sphere 
structure factor. 

This physical picture suggests that we attempt to treat a fluid as a system of mole- 
cules governed by a repulsive potential with an attractive potential that is treated as 
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a small perturbation. The unperturbed system, i.e., the system of repulsive molecules, 
is usually taken to be a system of hard spheres since this system is fairly well under- 
stood from Chapter 13. In Section 14-1 we shall introduce the basic statistical me- 
chanical perturbation theory due to Zwanzig. Then in Section 14-2 we shall make a 
simple application of this perturbation theory formalism to derive the van der Waals 
equation. Remember that van der Waals in 1873 treated a liquid as a system of hard- 
sphere molecules with an attractive potential that gave a uniform background energy. 
Thus he was one of the earliest to use the perturbation theory ideas presented here. 
Van der Waals did not have the benefit of modern developments in the statistical 
mechanics of hard-sphere fluids and statistical mechanical theory of fluids in general, 
and we shall see how it is possible to improve upon the van der Waals equation in a 
very obvious and organized way. There were a number of developments along these 
lines in the late 1960s, and Section 14-3 will discuss some of these. We shall see in 
that section that the perturbation approach to liquids can be brought to the point of 
providing a simple, convenient theory of liquids that is numerically reliable well past 
the critical point and into the temperatures and densities of dense liquids. Recall that 
the integral equations of Chapter 13 gave poor numerical agreement near and below 
the critical region. 


14-1 STATISTICAL MECHANICAL PERTURBATION THEORY 


In this section we describe the statistical mechanical perturbation theory developed 
by Zwanzig in 1954.* The results are obtained by performing a perturbation expansion 
on the canonical partition function of the system and on the configuration integral 
in particular. 

Let the total potential energy be separated into two parts, 

U N =U^ + U™ 04 - 2 ) 

where U N i0) is the potential energy of an unperturbed (reference) system and U N {1) 
is the perturbation. The reference system is usually taken to be a hard-sphere system, 
but this is, of course, not necessary. The perturbation potential is usually defined by 
the difference between the potential of the real system and the potential of the reference 
system, i.e., by U N W = U N — U N i0) . 

The configurational integral for this potential is 

Z N = J- • • J e-Ko-w+Wi dr 1 • • • dr N 

We now multiply and divide by 


to get 




*1 ‘ ‘ ’ rfr N 


r l e -K v » ( °'+ v " il 'Mr t ---dT N 


Note that the second factor here can be considered to be the average of exp(— PU N W ) 
over the unperturbed or reference system. Thus we can write 


Zn = Z w (0) <exp(— ^l/ Ar (1) )> 0 


( 14 - 3 ) 


* R. W. Zwanzig, J. Chem. Phys ., 22, p. 1420, 1954. 
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where Z N i0) is the configurational integral of the unperturbed system and < > 0 indicates 
a canonical average in the unperturbed system. We assume that we know Z N i0 \ and 
we shall consider U N {1) to be small enough to expand the exponential in U N W m This 
gives an expansion in powers of /? or 1/T, according to 

<exp = 1 - P<V™ > 0 + ^<(l/ (1)2 > o + • • • (14—4) 

We wish, however, to express the free energy and related thermodynamic functions 
as a power series in /?. Since A = —AT In Q and Q = Z n /N\A 3n , we can write 

-PA = ln (j^X5*) + In<«p(-j&lV 1, > 0 04-5) 

= -PA 0 - /L4< J > 

where clearly A 0 is the free energy of the reference system and A {1) is the perturbation 
free energy: 

A<» = -kT\n<exp(-pU N W)> 0 (14-6) 

It is this quantity that we wish to express as a power series in /?. We write 

A w = £ ^ (-W 1 (14-7) 

where the co n are to be determined in terms of the <(C/ W (1) )"> 0 of Eq. (14-4) by writing 

exp (-/M (1) ) = exp^£ ^ (-/?)") 

We now expand the right-hand side in powers of P and compare like coefficients of 
/5" to 

exp(-/L4 (1) ) = <exp(— /?l/ w (1) )> 0 

= 1 ^<(^ (1) )‘>o 

fc = 0 

Straightforward algebra gives the first few <a n : 

®i = <^ (1) >o 

e>2 = <(t/ w (1) ) 2 > o - <C/ W (1) >0 2 0 4-8) 

®s = <(^ (1) ) 3 >o - 3<(t/ N (1) ) 2 >0 <t/ w (1) >0 + 2<l/ w (1) >0 3 

It is possible to derive a general formula for this, but it is not necessary here.* 

Thus we have shown that at high temperatures, the free energy is 

(O, , 

A = A o + <o 1 -^ + 0(p 2 ) (14-9) 

The first term in the expansion, co 1 = <(7 JV (1) ) 0 , simplifies considerably when t/ w (1) 
can be written as a sum of pair potentials, 

£/» (1> - I«°W 


R. W. Zwanzig, /. Chem. Phys., 22, p. 1420, 1954. 
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since in this case 




N(N - l)/2 


~ (0) f-f. ’('i 2) dr 1 dr N 

= I // W<1, ( , ' 12 ^ o(2) (' 12 ) Jr > * 2 = J l ' (1) ( r > 2 ^ 0<2) (' 12 ) *12 ( 14_1 °) 


The zero subscript in the last two integrals indicates that the distribution function is 
that of the reference system. 

On the other hand, terms like <(U W (1) ) 2 > 0 are awkward since they introduce higher- 
order density functions, namely, P C3) and P C4) , which are due to terms like w(/* 0 )w(r Jfc ) 
and w(r i jMr*|) which arise when U N il) is squared. We shall not require this second-order 
term in most of what we do here, and so we do not write it out. 

Equations (14-3), (14-4), and (14-8) are the basic results of this section. In order 
to apply them, we must choose some reference fluid and hence U N {0) and U N 0) . This 
is where the various applications, which we shall discuss in Section 14-3, differ from 
one another. Before going on to discuss these, however, we shall derive the van der 
Waals equation from a simple application of the equations of this section. 


14-2 THE VAN DER WAALS EQUATION 


We start with Eq. (14-3), 

Zw = Z w ( °)<exp(— /?l/ w (1) )>o 


and take the reference system to be a hard-sphere fluid. We furthermore assume that 
U N is pair-wise additive and to be of the form u(r) = w HS (r) + w (1, (r)» where u {1 \r) is 
some arbitrary attractive part (and so is negative). We then assume that PU N {1) is 
small enough that we can write 

<exp(— /?l/ w (1) )> 0 « 1 — /? <C/jv (1) >o 

~exp< — /J(U w (1) >o) (14-11) 

Equation (14-1 1) follows since we are assuming that terms in fi 2 and higher can be 
neglected. From Eq. (14-10), 

<t/* (1) >o = ~ f u w (r)g H ^r)4nr 2 dr 
* J o 


Certainly # HS (/*) was not available to van der Waals, and he effectively approximated 
dHs(r) by 


#Hs( r ) 


-I? 


r <a 
r>a 


(14-12) 


This form is correct for hard spheres only in the limit p 0. Using this # HS (r), we have 


<U N (1) >o = 27ip 2 F f u {1) (r)r 2 dr 

J tr 

= - aNp (14-13) 


where 


a = 


—2n 



w (1) (r)r 2 dr 


( 14 - 14 ) 
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The minus sign in the definition of a has been included to make a a positive number. 
Remember that w (1) (r) is negative. The specific form of w (1) (r) is not important here. 
So far now, Eq. (14-3) has been reduced to 

Z N = Z N ^e papN ( 14 - 15 ) 

and so we can write 


p_ = I d In Z N (0> \ _ afS_ 

kT \ dV Ijyj kT 
p i0) ap 2 

= If 


( 14 - 16 ) 


where /> (0) is the pressure of the unperturbed system. 

The final approximation of the van der Waals theory is to assume that the hard- 
sphere configuration integral is of the form V^ 9 where the effective volume is deter- 
mined by assuming that the volume available to a molecule in the fluid has a volume 
47ic7 3 /3 excluded to it by each other molecule of the system. However, we have to divide 
this quantity by 2 since this factor of Ana 3 /3 arises from a pair of molecules inter- 
acting, and only half the effect can be assigned to a given molecule. Therefore K ff = 
V — 2ttN<t 3 /3, and we can write 

Z w (0) = (V— Nbf b = ~ (14-17) 


Substituting Eq. (14-17) into Eq. (14-16), then, we get the famous van der Waals 
equation, 


P P ap 2 
kT 1 -bp kT 


( 14 - 18 ) 


Equation (14-18) is the van der Waals equation of state. Thus the constants a and b 9 
defined in Eqs. (14-14) and (14-17), are the usual van der Waals constants, but here 
they are given in terms of the intermolecular potential function. We can calculate 
a and b for the square-well potential, for example, and compare the results to the 
empirically determined van der Waals parameters. The agreement is not particularly 
satisfactory, indicating that the van der Waals equation is not a good quantitative 
equation of state. Nevertheless, it does indicate a critical point and condensation by 
way of a simple model. 

The perturbation theoretic derivation of the van der Waals equation suggests a 
number of obvious improvements. The most obvious ones are: 

(1) Use a better form for g(r). 

(2) Use a better expression for Z w (0) . 

(3) Use a more realistic unperturbed system. 

(4) Consider higher terms in /J. 

The results of Chapter 13 allow us easily to consider points (1) and (2). We could use 
the analytic Percus-Yevick equations of state [Eqs. (13-91) and (13-92)], the Ree 
and Hoover Pad6 approximant [Eq. (13-3)], or the Camahan-Starling equation of 
state (Problem 12-34). Furthermore, we could use the # HS (r) obtained from the 
Percus-Yevick or hypernetted-chain equations or even that calculated from molecular 
dynamics or Monte Carlo calculations. Points (3) and (4) are more difficult to in- 
corporate since for point (3) we still must rely upon the results from the radial dis- 
tribution theories of the previous chapter to give the unperturbed part and for (4) 
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because terms beyond the linear term in /? require three- and four-body distribution 
functions. The recent advances in the statistical mechanical perturbation theory of 
fluids have considered points (3) and (4) above, and we discuss these in the next 
section. 

14-3 SEVERAL PERTURBATION THEORIES OF LIQUIDS 

In the late 1960s and early 1970s there was much activity in applying the statistical 
mechanical perturbation theory to dense fluids. In this section we shall discuss three 
of these theories. Each of these theories uses a fairly different approach and illustrates 
a particular manner in which the simple van der Waals theory of the preceding 
section can be improved. 

THE BARKER-HENDERSON THEORY 

The first generally successful approach was that due to Barker and Henderson.* 
For simplicity, we shall first discuss their theory for a square-well fluid and then extend 
their approach to real (6-12) fluids. Barker and Henderson were the first to consider 
the higher-order terms in the l/T expansion in Eq. (14-9). The first-order term, i.e. 
co 1 in Eq. (14-9), presents no difficulty [cf. Eq. (14-10)]. The second-order term 
involves three- and four-body distribution functions: 

" 2 = T Jiff [p ° (4)(1 ’ 2> 3 * 4) “ p o ( 2 ) (i, 2 * )f> ° (2)(3 ’ 4)] 

x W (1 >(1, 2 )m (1 >(3, 4) dr 1 dr 2 dr 3 dt A 
+ N 3 JJJ P 0 (3) (l, 2, 3 )m (1) (1, 2)k«>(2, 3) dr t dt 2 dt 3 

+ y JJ P 0 (2) a . 2 )[k (1) (1, 2)] 2 dr , dx 2 (1 4-1 9) 

where 

P 0 (B) (1, 2,...,»») = =^ojf-f dr n+ 1 ••• dr N (1 4-20) 

We have left co 2 in terms of the P 0 (n) rather than the more commonly used g 0 (n) (1, 2, 
. . . , n) because the conversion of Eq. (14-19) from integrals over the P 0 (n) to integrals 
over g 0 {n) ’s is somewhat subtle. This subtlety is due to the fact that g(r) 1 + 0(1 /N) 
for large r,f and so the integrand in Eq. (14-19) differs from zero by terms of order 1 
in a region of molecular dimension and by terms of order l/N in the whole volume; 
so the integral of these terms contribute the same order of magnitude to the final 
result. We shall not discuss this fine point any further, but refer the reader to a dis- 
cussion by Henderson and Barker.} 

The important point about Eq. (14-19) is that it involves three- and four-body 
distribution functions, quantities that are not generally known. Barker and Henderson 
have approximated co 2 in two different ways. The first method goes as follows. 


* J. A. Barker and D. Henderson, J. Chem. Phys., 47, p. 2856, 1967; Accounts of Chem . Pcs., 4, p. 303, 1971 ; 

Adv. Treatise of Phys. Chem., 8A (in “Additional Reading*'); Arm. Rev. Phys. Chem., 23, p. 439, 1972. 
t See T. Hill, Statistical Mechanics, Appendix 7 (New York: McGraw-Hill, 1956), in “Additional Reading." 
% See D. Henderson and J. A. Barker, in Physical Chemistry, Vol. 8A, edited by H. Eyring, D. Henderson, 
and W. Jost (New York: Academic Press, 1971) in “Additional Reading.’* 
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Imagine the range of intermolecular distances divided into intervals (r 0 , /*), 

(/*!, r 2 ), . (r j9 r y+1 ), Let p(N 0 , N u N 2 , . . .) = p({Nj}) be the probability that 

Nj molecules lie in the interval (r j9 r J+1 ) in the unperturbed system. Also consider 
that the interval (r jy r j+1 ) is small enough that we can treat the perturbing potential 
w (1) (r) to have a constant value u/ l) in the interval (r 7 - , r J+1 ). The configuration integral, 
Eq. (14-3), can then be written in the form 

Z w = z w (0) ^exp ( - 0 E N J U j l) )\ 

= Zw (0) I KW)exp( - 0 1 N j M y (1> ) (1 4-21 ) 

The quantity p({Nj}) is as difficult to get as Z N , but only the first two moments are 
necessary in order to calculate through terms of 0(p 2 ). To see this, expand the 
exponential in Eq. (14-21) to get 

W'~ 

I "j) J (Nj) 

+ r I I «« (1 V 0 1 N i N jp« N j}) + ■■■ 

2 i J {Nj) 

= 1-01 «i (1> W + % II «i (1) <N, Nj} H 

To get the expansion for the Helmholtz free energy, we take logarithms to get 

^ = ^ + 01 <Nj >m/ 1) - i0 2 I I «N| N,> - <N { > <Ny»Mj (1) My (1 > + O(0 3 ) 

(14—22) 

The first moment is easy and is 
<JVy> = 2nNp f 1 r 2 g 0 (r ) 

= 2nNprj 2 g 0 (rj)(r j+1 - r ; ) (1 4-23) 

The evaluation of the second-moment term in Eq. (14-22) must be done approxi- 
mately. The Nj may be regarded as representing the numbers of molecules in spherical 
shells surrounding some central molecule. If these shells were large macroscopic 
volumes, the numbers of molecules in different shells would be uncorrelated, i.e., 

<iV i N y> - <N f > <N y > = 0 i*j (1 4-24) 

and the fluctuation of the number in a given shell would be given by [Eq. (3-53)]: 

<N/>-<N,y = <N ,>*r(!) r 

Thus the P 2 term in the expansion in Eq. (14-22) can be approximated by 


(14-25) 
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If we substitute Eq. (14-23) for <7Vj> into this and then pass to the continuum limit, 
we get 

- ^ J W l \r)fkT^ g 0 (r)4nr 2 dr + 0(fi 3 ) (1 4-26) 

This approximation for the ft 2 term is called the macroscopic compressibility approxi- 
mation . A more satisfactory approximation is obtained by arguing that one should use 
a “local” compressibility in Eq. (14-26), related to the pressure derivative of the 
density at a distance r from a given molecule; i.e., replace (dp/dp) 0 g 0 (r) in Eq. (14-26) 
by (8/dp)[pg(r )] 0 . This so-called local compressibility approximation gives 

_ J [ u ^(r)] 2 kT [^ Pgo) ] 4nr 2 dr + 0(p 3 ) (14-27) 

Note that the Helmholtz free energy A is expressed in terms of hard-sphere quantities 
and the perturbing potential w (1) (r). 

In applying this equation to a square-well system; Barker and Henderson took the 
hard-sphere core as their unperturbed potential and the well as the perturbing po- 
tential. The comparison of the numerical results of this equation with the computer 
calculations of Alder and Wainwright* and Rotenbergt is shown in Fig. 14-2. In 



Figure 14-2. Equation of state for the square-well potential. The points are the results of various mole- 
cular dynamics and Monte Carlo calculations, and the curves are Isotherms calculated 
from Eq. (14-27). The numbers labeling the curves give the values of e/kT. (From J. A. 
Barker and D. Henderson, /. Chem. Phys. 9 47, p. 2856, 1967.) 

* B. J. Alder and T. E. Wainwright, /. Chem. Phys. 33, p. 1439, 1960. 
f A. Rotenberg, J. Chem. Phys. 9 43, p. 1198, 1965. 
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obtaining these results, the Pade approximant of Hoover and Ree was used for the 
hard-sphere pressure p (0) . 

Smith et al* have also approximated co 2 by using the superposition approximation 
on the three- and four-body distribution functions. This, of course, allows A to be 
written in terms of the two-body hard-sphere radial distribution function. The equa- 
tions are too long to be given here, but the results are similar to the solid curves in 
Fig. 14-2. The agreement with the molecular dynamics results of Alder [Fig. 14-2(a)] 
is excellent. The agreement with the Monte Carlo results of Rotenberg is good except 
at the highest densities. Table 14-1 shows the first- and second-order terms for the 
Helmholtz free energy and the pressure. It can be seen that the perturbation series 
seems to converge rapidly. Table 14-2 compares the critical constants from the Barker- 
Henderson theory to the square-well molecular dynamics results of Alder. Again there 
is very good agreement. 


Table 14-1. The first- and second-order terms In the Barker-Henderson perturbation series for the 
Helmholtz free energy and the pressure for a square-well fluid 


per 3 

AjNkT 

AjNkT 

PiVo/kT 

Pi Vo/kT 

l.c* 

s.a.f 

l.c* 

s.a.f 

0.00 

0.0000 

0.000 

0.000 

0.0000 

0.000 

0.000 

0.10 

-0.5339 

-0.188 

-0.188 

-0.0403 

-0.010 

-0.010 

0.20 

-1.1405 

-0.282 

-0.282 

-0.1816 

-0.017 

-0.016 

0.30 

-1.8159 

-0.310 

-0.312 

-0.4502 

-0.001 

-0.005 

0.40 

-2.5511 

-0.295 

-0.306 

-0.8601 

+0.035 

+0.018 

0.50 

-3.3292 

-0.253 

-0.289 

-1.3993 

0.088 

0.026 

0.60 

-4.1234 

-0.198 

-0.281 

-2.0120 

0.147 

-0.001 

0.70 

-4.8950 

-0.140 

-0.228 

-2.5787 

0.196 

-0.034 

0.80 

-5.5933 

-0.087 

-0.291 

-2.9072 

0.216 

+0.036 

0.90 

-6.1606 

-0.045 

-0.261 

-2.7701 

0.194 

0.328 

1.00 

-6.5484 

-0.018 

— 

-2.0513 

0.137 



* Local compressibility approximation, 
t Superposition approximation. 

Source: J. A. Barker and D. Henderson, J. Chem. Phys . , 47, p. 2856, 1969; Symp. Thermophys. Properties, 
Paperc, 4th, p. 30, 1968. 


Table 14-2. Critical constants for the square-well potential (A = 1.5) 



Barker-Henderson 

molecular 


theory 

dynamics 

kTJe 

1.32 

1.28 

Vc/Vc 

-0.22 

0.235 

P c V 0 /NkT c 

0.076 

0.072 


Barker and Henderson have applied their perturbation theory to a Lennard-Jones 
potential as well.f They actually develop the Helmholtz free energy as a double per- 
turbation series. One variable, y, is a measure of the depth of the attractive well, and 
the other variable, a, is a measure of the inverse steepness of the repulsive part of the 

* W. R. Smith, D. Henderson, and J. A. Barker, J. Chem . Phys., 53, p. 508, 1970. 
f J. A. Barker and D. Henderson, J. Chem. Phys., 47, p. 4714, 1967. 
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potential, i.e., the smaller a, the steeper the repulsive part. They do this by defining a 
modified potential function v(a, y, d; r) corresponding to w(r) by 


m(</ + 



d + 


r-d 

< a 

a 


v(cc, y, d;r) = J 


0 


yu(r) 


. r — d . <7 — d 

a<d + <d + (14-28) 

OL Ct 

a <r 


The quantity d is a distance parameter which is as yet unspecified, and o is customarily 
taken to be that point at which the potential u(r) passes through zero. This appears 
to be complicated at first, but notice that i>(a, y, d; r) is independent of d and reduces 
to u{r) when a = y = 1. When a = y = 0, on the other hand, v becomes a hard-sphere 
potential of diameter d , which is still arbitrary (Problem 14-10). Thus by varying y 
and a, we can go from our original potential u{r) to a hard-sphere potential. 

The idea now is to express the Helmholtz free energy A in terms of this modified 
potential as a double power series about a = y = 0 according to 

. / 3 j 4 \ a 2 (d 2 A\ 

a - a ° + -y + %) ..... + 7 (*?)..„„ + • • • <h - 29 > 

Note that the coefficients of a, y, a 2 , etc., here are evaluated at a = y = 0 or, in other 
words, are hard-sphere quantities. The differentiations indicated in Eq. (14-29) involve 
lengthy algebra,* but the final result is quite simple: 

A = A 0 + alnNkTp d 2 0 o (d)|^f + J /(z) dzj -I- ylnNp J g 0 (r)u(r)r 2 dr 

- V 2jiN p(^} ^ [p J So (r)u 2 (r)r 2 drj + 0(a 2 ) + 0( ay) + • • • (1 4-30) 


In this result A 0 , g 0 , and ( dp/dp) 0 are the free energy, radial distribution function, 
and compressibility of a system of hard spheres of diameter d (as yet unspecified). 
The first-order terms in Eq. (14-30) are exact, but the y 2 term has been approximated 
by the local compressibility approximation. 

The value of d is still at our disposal. We choose for d the value 


rf=-f/(z)rfz (14-31) 

J o 

so that the linear term in a vanishes. This gives d as a well-defined temperature- 
dependent effective hard-sphere diameter. Barker and Henderson argue that with this 
choice of d, the terms in a 2 and ay in Eq. (14-30) are considerably smaller than the 
y 2 term. 

For a = y = 1, Eq. (14-30) is just the Helmholtz free energy for a system with 
potential w(r), and so we finally write 


A = An + 


2nNpj g 0 (r)u(r)r 2 dr - npP^J pj g 0 (r)u 2 (r)r 2 drj 


(14-32) 


where A 0 , g 0 (r), and ( dp/dp) 0 are the free energy, radial distribution function, and 
compressibility of a system of hard spheres of diameter d, given by Eq. (14-31). 


* See the reference to Barker and Henderson in “Additional Reading" for complete details. 
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Figure 14-3. The equation of state for the 6-12 potential according to Barker and Henderson. The 
curves are labeled by the value of T*. The points are a mixture of machine calculations and 
actual experimental data. (From J. A. Barker and D. Henderson, J. Chem. Phys., 47, 
p. 4714, 1967.) 



Figure 14-4. Internal energy for the 6-12 potential according to Barker and Henderson. The curves are 
labeled with the values of T*. The points are a mixture of machine calculations and actual 
experimental data. (From J. A. Barker and D. Henderson, /. Chem. Phys., 47, p. 4714, 
1967.) 
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0.6 0.8 1.0 1.2 1.4 


T* 

Figure 14-5. Densities of coexisting phases for the 6-12 potential according to Barker and Henderson. 

The points are a mixture of machine calculations and actual experimental data. (From J. A. 
Barker and D. Henderson, /. Chem. Phys. t 47, p. 4714, 1967.) 



r/o 

Figure 14-6. Radial distribution function of the 6-12 liquid near its triple point. The points give the 
results of molecular dynamics calculations, and the curves give the results of the Barker- 
Henderson perturbation theory. The quantities p* and T* are pu 3 and kT/e, respectively. 
(From J. A. Barker and D. Henderson, Ann. Rev. Phys. Chem. 3 23, p. 439, 1972.) 

Figures 14-3 through 14-5 show the results from Eq. (14-32) and other thermo- 
dynamic functions. These comparisons indicate that the Barker-Henderson perturba- 
tion theory gives excellent results at temperatures that are not too low. Figures 14-6 
and 14-7 compare the radial distribution function of the Barker-Henderson theory 
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Figure 14-7. Radial distribution function of the 6-12 fluid. The points give the results of molecular 
dynamics calculations, and the broken and solid curves give the result of the Percus-Yevick 
and Barker-Henderson theories, respectively. The quantities p* and T* are pu 3 and kTje 9 
respectively. (From J. A. Barker and D. Henderson, Ann. Rev. Phys. Chem 23, p. 439, 
1972.) 

to that from the Percus-Yevick equation and molecular dynamics. Note that the con- 
ditions of Fig. 14-6 are near the critical point and so this serves as a fairly severe test. 
Figure 14-7 compares the radial distribution function around the first maximum. 
Again one can see that the agreement is excellent. 

Table 14-3 compares values of A/NkT for a 6-12 fluid for a number of theories. 
The column headed by PY(is) indicates the Percus-Yevick results starting from the 
energy equation. We saw in Fig. 13-12 and the discussion surrounding it that this seems 
to be the most reliable treatment of the Percus-Yevick theory. The columns headed 
BHl(PY), BH1, and BH2 are the results of the Barker-Henderson theory calculated 
from first-order perturbation theory using the PY hard sphere g(r ), from first-order 
perturbation theory using the Monte Carlo values of the hard sphere g(r), and from 
second-order perturbation theory using the Monte Carlo g 0 (r)- The agreement of the 
BH2 results with those of the molecular dynamics is excellent. The other columns 
in Table 14-3 will be discussed shortly. Table 14-4 compares values of p/pkT in a 
similar way. Table 14-5 compares the critical constants obtained from this theory to 
the molecular dynamics calculations of Verlet and experimental data of argon. 
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Table 14-3. Valnes of A/NkT for the 6-12 potential 


kT/e 

9°* 

mole- 

cular 

dyna- 

mics* 

PY(£) 

BH1- 

(PY) 

BH1 

BH2 

CWA 

(PY) 

CWA 

2.74 

0.60 

-0.34 

-0.33 

-0.31 

-0.31 

-0.33 

-0.32 

-0.33 


0.70 

+0.01 

+0.01 

+0.04 

+0.02 

+0.01 

+0.02 

+0.01 


0.80 

0.43 

0.43 

0.45 

0.46 

0.42 

0.43 

0.41 


0.90 

0.93 

0.95 

0.97 

0.99 

0.95 

0.97 

0.92 


1.00 

1.59 

1.61 

1.65 

1.66 

1.62 

1.66 

1.56 

1.35 

0.60 

-1.77 

-1.75 

-1.67 

-1.65 

-1.75 

-1.73 

-1.74 


0.70 

-1.65 

-1.62 

-1.54 

-1.51 

-1.63 

-1.62 

-1.63 


0.80 

-1.41 

-1.37 

-1.30 

-1.26 

-1.41 

-1.39 

-1.41 


0.90 

-1.02 

-0.99 

-0.90 

-0.84 

-1.01 

-0.98 

-1.02 


0.95 

-0.72 

-0.72 

-0.62 

-0.55 

-0.72 

(-0.67) 

(-0.74) 

1.15 

0.60 

-2.29 

-2.28 

-2.16 

-2.15 

-2.30 

-2.25 

-2.26 


0.70 

-2.25 

-2.23 

-2.11 

-2.10 

-2.26 

-2.23 

-2.24 


0.80 

-2.06 

-2.06 

-1.95 

-1.92 

-2.10 

-2.08 

-2.09 


0.90 

-1.79 

-1.74 

-1.61 

-1.56 

-1.76 

-1.73 

-1.77 

0.75 

0.60 

-4.24 


-3.99 

-3.99 

-4.29 

-4.17 

-4.18 


0.70 

-4.53 

-4.50 

-4.26 

-4.26 

-4.28 

-4.51 

-4.51 


0.80 

-4.69 

-4.63 

-4.38 

-4.37 

-4.74 

-4.69 

-4.69 


0.90 


-4.55 

-4.29 

-4.26 

-4.67 

(-4.60) 

(-4.62) 

* Verlet et al. , Phys. Rev., 182, p. 307, 
Source: J. A. Barker and D. Henderson, 

1969; 159, p. 98, 1967. 
Arm. Rev. Phys. Chem., 23, 

p. 439, 1972. 




Table 14-4. Values of p/pkT for the 6-12 potential 

kT/e 

pa 3 

mole- 

cular 

dyna- 

mics* 

Monte 

Carlof 

PY(£) 

BH1- 

(PY) 

BH1 

BH2 

CWA 

(PY) 

CWA 

2.74 

0.65 

2.22 


2.23 

2.23 

2.24 

2.22 

2.21 

2.18 


0.75 

3.05 


3.11 

3.11 

3.14 

3.10 

3.11 

3.04 


0.85 

4.38 


4.42 

4.42 

4.48 

4.44 

4.50 

4.30 


0.95 

6.15 


6.31 

6.37 

6.41 

6.40 

6.57 

6.10 

1.35 

0.10 

0.72 


0.72 

0.77 

0.77 

0.74 

0.77 

0.77 


0.20 

0.50 


0.51 

0.54 

0.55 

0.52 

0.53 

0.53 


0.30 

0.35 


0.36 

0.35 

0.39 

0.36 

0.32 

0.31 


0.40 

0.27 


0.29 

0.25 

0.26 

0.26 

0.17 

0.17 


0.50 

0.30 


0.33 

0.29 

0.31 

0.27 

0.18 

0.18 


0.55 

0.41 


0.43 

0.40 

0.43 

0.35 

0.27 

0.27 


0.65 

0.80 


0.85 

0.85 

0.91 

0.74 

0.72 

0.71 


0.75 

1.73 


1.72 

1.77 

1.87 

1.64 

1.70 

1.64 


0.85 

3.37 


3.24 

3.36 

3.54 

3.36 

3.51 

3.28 


0.95 

6.32 


5.65 

5.96 

6.21 

6.32 

(6.58) 

(5.90) 

1.00 

0.65 

-0.25 


-0.22 

-0.25 

-0.21 

-0.36 

-0.51 

-0.50 


0.75 

+0.58 

0.48 

+0.57 

+0.62 

+0.71 

+0.53 

+0.43 

+0.40 


0.85 

2.27 

2.23 

2.14 

2.30 

2.48 

2.25 

2.41 

2.20 


0.90 

~3.50 


3.33 

3.57 

3.79 

3.53 

(3.96) 

(3.55) 

0.72 

0.85 

0.40 

0.25 

0.33 

0.50 

0.70 

0.25 

0.43 

0.26 


0.90 


~1.60 

1.59 

1.90 

2.15 

1.63 

(2.24) 

(1.83) 


* Verlet et al., Phys. Rev., 182, p. 307, 1969; 159, p. 98, 1967. 
f McDonald and Singer, J. Chem. Phys., 50, p. 2308, 1969. 

Source: J. A. Barker and D. Henderson, Arm. Rev. Phys. Chem., 23, p. 439, 1972. 






SEVERAL PERTURBATION THEORIES OF LIQUIDS 315 


Table 14-5. Critical constants for the 5-12 potential 



Barker- 

Henderson 

Verlet* 

Exp. 

(argon)t 

T* 

1.35 

1.32-1.36 

1.26 

Pc* 

0.30 

0.32-0.36 

0.316 

Pc* 

0.14 

0.13-0.17 

0.117 

Pc VjNkTc 

0.34 

0.30-0.36 

0.293 


* L. Verlet, Phys. Rev., 159, p. 98, 1967. 

f J. S. Rowlinson, Liquids and Liquid Mixtures (London: Butterworths, 1959). 


The Barker-Henderson theory was the first generally successful perturbation theory 
and showed that perturbation theory is probably the most appealing approach to the 
liquid state. Barker and Henderson have extended and summarized their approach 
in a number of later papers.* Two important extensions are the generation of tables 
that allow one to use Monte Carlo values of A 2 and the inclusion of three-body forces. 
These are discussed in the most recent reference given below. 


THE CHANDLER-WEEKS-ANDERSEN THEORY 

In 1971 a new perturbation theory of liquids was formulated by Chandler, Weeks, 
and Andersen.j The primary difference between their approach and the Barker- 
Henderson approach is the way in which the intermolecular potential is divided into 
an unperturbed and a perturbed part. Chandler, Weeks, and Andersen argue that it is 
physically significant to separate the potential into a part containing all the repulsive 
forces and a part containing all the attractive forces. As they point out, “As far as the 
motion of a molecule in a liquid is concerned, the molecules do not ‘ know ’ the sign 
of their mutual potential energy, but do ‘ know ’ the sign of the derivative of the po- 
tential, i.e., the sign of the force.” Notice that this separation is distinctly different 
from separating the potential into positive and negative parts as in the Barker- 
Henderson theory. 

Thus the CWA theory separates the intermolecular potential according to 
w(r) = « 0 (r) + i/ (1) (r) 


where for a Lennard-Jones 6-12 potential: 

u 0 (r) = u(r) + e r < 2 lf6 c 
= 0 r ^ 2 l/6 c 

w (1) (r) = — e r < 2 l/6 o 

= u(r) r > 2 1,6 c 


(14—33) 


In Eq. (14-33), e represents the depth of the potential well of w(r), and the term 2 1/6 <t 
is the same as r min for a 6-12 potential. This division is shown in Fig. 14-8. 

The separation utilized in the CWA theory has two advantages over other separa- 
tions. Equation (14-8) shows that the higher-order terms co 2 , co 3 , . . . are like central 
moments about some distribution, and so are smaller the more smoothly w (1) (r) varies. 
As Fig. 14-8 shows, the perturbing part of the potential chosen in the CWA theory 


* J. A. Barker and D. Henderson, J. Chem. Educ., 45, p. 2, 1968; W. R. Smith, D. Henderson, and J. A. 
Barker,/. Chem. Phys., 53, p. 508, 1970; ibid., 55, p. 4027, 1971; J. A. Barker and D. Henderson, 
Accts. Chem. Res. 4, p. 303, 1971; Ann. Rev. Phys. Chem., 23, p. 439, 1972. 
f D. Chandler and J. D. Weeks, Phys. Rev. Lett., 25, p. 149, 1970; J. D. Weeks, D. Chandler, and H. C. 
Andersen, J. Chem. Phys., 54, p. 5237, 1971. 
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Figure 14-8. The CWA separation of the Lennard- Jones potential, i/(r), into a part containing all the 
repulsive forces, w 0 (r), and a part containing all the attractive forces, i/ (1> (r). (From J. D. 
Weeks, D. Chandler, and H. C. Andersen, J. Chem. Phys ., 54, p. 5237, 1971.) 

does indeed vary slowly for all r. This is to be contrasted with Barker and Henderson’s 
separation of u(r), in which u 0 (r) is that part of u(r) greater than zero and w (1) (r) is 
that part less than zero. Their w (1) (r) varies quite rapidly with r, particularly between 
r = a and r = 2 1/6 <t. Secondly, the reference system chosen by Chandler, Weeks, and 
Andersen is more realisticthan a hard-sphere system. In fact, we shall see below that 
at high densities the radial distribution function of the Lennard-Jones systems can be 
very well approximated by the radial distribution function of a system with the po- 
tential u 0 (r) in Fig. 14-8, i.e., the reference system of CWA. 

Using Eq. (14-33) in Eqs. (14-9) and (14-10), then we have 

A = A 0 + ip 2 V J g 0 (r)u {1) (r) <&■ + ••• (1 4-34) 

where, of course, the zero subscripts indicate the reference system with potential u 0 (r ) 
given in Eq. (14-33). For the purpose of comparison with experimental data and other 
theories, it is convenient to calculate not A itself, but the excess free energy with respect 
to the ideal gas at the same volume, temperature, and density. If we denote this excess 
free energy by AA, we can write Eq. (14-34) in the form 

/NL4 = Mfo + jdrg 0 (r)u w (r) + • ■ ■ (14-35) 

Chandler, Weeks, and Andersen argue on physical grounds and then show numerically 
(see below) that for lo w as well as high temperatures this equation will be approximately 
valid, i.e., errors 10 percent when the density is low, but will become much more 
accurate as the density increases. 

The principal disadvantage of the CWA theory is that the properties of the reference 
state are not well known as they are for a hard-sphere system. In order to avoid having 
to perform expensive machine calculations to obtain this information, they present 
an approximate treatment of the reference system. The essence of their approximation 
goes back to Fig. 13-8, which shows the function y(r) = g(r)exp[Pu(r)\ versus r. It 
was stated there that the shape of y(r) is not too sensitive to the intermolecular po- 
tential. Chandler, Weeks, and Andersen approximate y(r) for the reference fluid, i.e.. 
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y 0 (r), by the similar function appropriate to a hard-sphere system of diameter d, 
y d (r). For g 0 (r), then, we have* 

9o( r ) - yd(r)exp[-/?Wo(r)] (14-36) 

They choose the value of d by requiring the thermodynamic properties of the reference 
system to equal those of a hard-sphere system through the equation [Eq. (13-54)] 

Thus if we equate the compressibility of the reference system to the compressibility 
of a hard-sphere system, we have 

J di(y d e- p “° - 1) = J dt(y a e ~ - 1) (1 4-37) 

where u d denotes as hard-core repulsion of diameter d. This equation gives a unique 
value of d as a function of both temperature and density. Yerlet and Weisf have 
presented a simple algorithm for d(P, p). Once a value of d is obtained for a particular 
p and /?, the free energy of the reference system is taken to be the free energy of a 
hard-sphere system with diameter d and density p. Figure 14-9 shows the behavior 



Figure 14-9. The effective hard-sphere diameter according to the CWA theory [Eq. (14-37)]. (From 
J. D. Weeks, D. Chandler, and H. C. Andersen, /. Chem. Phys., 54, p. 5237, 1971.) 

* Such an approximation had been used previously, e.g. t By M. Orentlicher and J. M. Prausnitz, Can. J. 
Chem., 45, p. 595, 1967. 

t L. Verlet and J. Weis, Phys. Rev., 5A, p. 939, 1972. 
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Figure 14-10. Plot of h(k) for p* = 0.844, T* = 0.723. The line represents the use of Eq. (14-36); the 
circles are the molecular dynamics results of Verlet. (From J. D. Weeks, D. Chandler, 
and H. C. Andersen, / Chem. Phys. 54„ p. 5237, 1971.) 


Table 14-6. Perturbation theory and Monte Carlo results for the excess free energy on the T* = 0.75, 
1.15, and 1.35 isotherms 


-£A A/N 


T* 

P* 

CWA 

Monte 

Carlo 

Barker- 

Hienderson 

0.75 

0.1 

0.55 

0.81 

0.57 


0.2“ 

1.15 

1.48 

1.16 


0.3° 

1.78 

2.11 

1.77 


0.4' 

2.42 

2.68 

2.38 


0.5“ 

3.06 

3.23 

2.96 


0.6“ 

3.65 

3.74 

3.48 


0.7“ 

4.14 

4.17 

3.90 


0.8* 

4.46 

4.47 

4.16 


0.84 

4.51 

4.54 

4.20 

1.15 

0.1 

0.29 

0.39 

0.30 


0.2 

0.60 

0.73 

0.61 


0.3 

0.92 

1.05 

0.92 


0.4“ 

1.23 

1.33 

1.20 


0.5“ 

1.51 

1.59 

1.46 


0.55“ 

1.63 

1.69 

1.56 


0.6“ 

1.74 

1.79 

1.65 


0.65 

1.82 

1.84 

1.72 


0.75 

1.88 

1.88 

1.76 


0.85 

1.77 

1.78 

1.63 

1.35 

0.1 

0.22 

0.30 

0.23 


0.2 

0.45 

0.56 

0.46 


0.3 

0.68 

0.80 

0.69 


0.4 

0.90 

1.00 

0.89 


0.5 

1.09 

1.16 

1.05 


0.6 

1.22 

1.26 

1.16 


0.7 

1.26 

1.29 

1.18 


0.8 

1.16 

1.19 

1.07 


* Corresponds to one-phase metastable states in the liquid-gas two-phase region. 

Source: J. D. Weeks, D. Chandler, and H. C. Andersen, /. Chem. Phys., 54, p. 5237, 1971 
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of d obtained numerically from Eq. (14-37) versus p and /?. It is interesting to note 
that the curves of d(p, /?) obtained here are very similar to the empirical curves for an 
effective hard-sphere diameter determined empirically by Verlet* in the discussion 
surrounding Fig. 14-1. Figure 14-10 shows fi(k ) [Eq. (14-1)] obtained from the ap- 


Table 14-7. Approximate results for the excess free energy on the p* — 0.88, 0.85, 0.75, and 0.65 
isochores 


p* 

T* 

CWA 

-P&A/N 

computer 

results 

Barker- 

Henderson 

0.88 

1.095 

1.94 

1.99 

1.80 


0.94 

2.84 

2.88 

2.65 


0.591 

6.80 

6.86 

6.30 

0.85 

2.889 

-0.93 

-0.86 

-0.94 


2.202 

-0.44 

-0.37 

-0.47 


1.214 

1.51 

1.55 

1.39 


1.128 

1.86 

1.91 

1.73 


0.880 

3.32 

3.35 

3.10 


0.782 

4.18 

4.21 

3.90 


0.786 

4.14 

4.17 

3.86 


0.760 

4.41 

4.44 

4.10 


0.719 

4.87 

4.90 

4.53 


0.658 

5.66 

5.71 

5.26 


0.591 

6.74 

6.81 

6.25 

0.75 

2.849 

-0.56 

-0.50 

-0.58 


1.304 

1.36 

1.41 

1.27 


1.069 

2.22 

2.26 

2.08 


1.071 

2.21 

2.25 

2.07 


0.881 

3.27 

3.30 

3.06 


0.827 

3.66 

3.69 

3.43 

0.65 

2.557 

-0.18 

-0.20 

-0.11 


1.585 

0.79 

0.85 

0.73 


1.036 

2.24 

2.30 

2.12 


0.900 

2.90 

2.96 

2.74 


Source: D. Chandler, and H. C. Andersen,/. Chem. Phys., 54, p. 5237, 1971. 


Table 14-8. Approximate and molecular dynamics results for the pressure and internal energy as 
obtained from Eqs. (14-38) and (14-39), respectively* 


P * 

T * 


PpIp 



-0A EIN 


CWA 

MD 

PY 

CWA 

MD 

PY 


1.128 

2.82 

2.78 

3.57 

5.08 

5.05 

4.98 


0.88 

1.82 

1.64 

3.17 

6.77 

6.75 

6.61 


0.786 

1.23 

0.99 

2.97 

7.70 

7.70 

7.51 


0.719 

0.69 

0.36 

2.82 

8.52 

8.51 

8.28 


* Columns 3 and 6 give the values obtained by applying the approximation g(r) = g 0 (r). Columns 4 and 7 
give the molecular dynamics results. The results obtained by using the solution of the Percus-Yevick 
equation for g(r) are given in columns 5 and 8. 

Source: J. D. Weeks, D. Chandler, and H. C. Andersen, /. Chem. Phys., 54, p. 5237, 1971. 


proximate radial distribution function in Eq. (14-36) for p* = pa 3 = 0.844 and T* — 
0.723. It can be seen from this just how much the repulsive part of the potential deter- 
mines the structure of a liquid at high densities. A similar result is found for the radial 
distribution function itself. 


L. Verlet, Phys. Rev., 165, p. 201, 1968. 
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Tables 14-6 through 14-8 show some thermodynamic results calculated from the 
CWA theory. Table 14-6 shows the excess free energy calculated according to Eq. 
(14-35). One sees that the excess free energy does converge at high densities (p* > 0.65) 
to the values predicted by computer calculations. Table 14-7 shows some more de- 
tailed results for fiAA/N. The CWA theory does seem to improve as the density 
increases. Table 14-8 gives some results for the pressure and the excess internal energy. 
Although these quantities could be calculated from A , they were actually calculated 
from the equations 

Pp = P~ iPp 2 J m'(r)g(r) dr (1 4-38) 

and 

^ = ip J u(r)g(r) dr (14-39) 

with g(r ) replaced by simply g 0 (r). It can be seen that the results are far superior to a 
solution of the Percus-Yevick equation for a 6-12 potential. The poorer results for 
the pressure found at the lowest temperatures in Table 14-8 are attributed to approxi- 
mating the reference system by an effective hard-sphere system rather than in any 
inadequacy of the CWA approach. 

Tables 14-3 and 14-4 include values of A/NkT and pjpkT calculated from the CWA 
theory. The column headed CWA(PY) indicates that the Percus-Yevick g 0 {r) was used, 
and the column headed by CWA indicates that Monte Carlo values of g 0 (r) were used. 
The numbers in parentheses were calculated using a hard-sphere system which was so 
dense that these results are of uncertain accuracy. As one would expect from a satis- 
factory theory, the results calculated from the more accurate g Q (r) are in better agree- 
ment with the molecular dynamics and Monte Carlo results. The agreement is seen 
to be excellent. 

There is at present some discussion* concerning the relative merits of the BH and 
CWA theories, but it can be said that the two theories are complementary rather than 
competitive. They are based on differing compromises, and each has its merits and 
drawbacks.! The Barker-Henderson theory uses a well-known reference state, but 
must include terms second order in /?. The Chandler- Weeks-Andersen theory, on the 
other hand, is rapidly enough convergent that it does not require second-order terms, 
but the reference state is not universal, as in the case with perturbation theories based 
on hard spheres. Regardless, both theories are very successful and show that the per- 
turbation theory approach to the liquid state is not only physically appealing but 
numerically very satisfactory. 
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PROBLEMS 

14-1. By using the van der Waals equation and the fact that (dp/dV) T and (3 2 p/dV 2 ) T 
equal zero at the critical point, show that 


8 a 

ZTbR’ 


r - 3b ■ *-2n- 


and 


Pc V c 3 R 

T c ~ 8 


Show also that the van der Waals equation can be written in reduced form as 


(p*+^)(» — 5)-! 


T* 


where p* 21b 2 pla , V* = Vi 2b, and T* - TWbTfta. 

14-2. Determine the virial coefficients associated with the van der Waals equation. How 
do they compare to the exact ones? 

14-3. The critical constants of nitrogen are p c = 33.6 atm, V c = 0.090 liters/mole, and 
T c = 126°K. Calculate the van der Waals constants and determine the molar volume of 
nitrogen at 300°K and 1 atm; 10 atm; 100 atm. Compare these values to those calculated 
from the ideal gas equation of state. 

14-4. Show that the van der Waals equation is a special case of the more general equation 


P = 


NRT 

~V~ 


Kp)-«0>) 


where a and (3 are functions of the density p, but independent of temperature. Rigby* calls 
this a generalized van der Waals equation. Show that two thermodynamic consequences of 
this equation are that ( ,dp/dT) v is a function only of the molar volume and that the entropy 
is determined solely by the £(/>) term. 

14-5. Using Eqs. (14-14) and (14-17), calculate the van der Waals constants a and b for 
nitrogen. For «(r), assume a Lennard-Jones 6-12 potential with e and a given in Table 12-3. 
Compare these calculated values to the experimentally determined values, a= 1.39 x 10 6 
cm 6 • atm/mole 2 and b — 39.1 cm 3 /mole. Such poor agreement is quite typical, simply indi- 
cating the inadequacy of the van der Waals equation. 

14-6. Show how terms in (1 IT) 2 involve g {3} and p (4) as well as g i2) when part of the inter- 
molecular potential is treated as a perturbation. Write out this term. 

14-7. Show that the expression for the Helmholtz free energy of a square-well fluid to 
second order in the Barker-Henderson perturbation theory is 


A -Ac 
NkT 


1 r ^° (r) dr - np {ii) i+V+V k [ p L r2&o(r) dr ] 


Quart. Rev., 24, p. 416, 1970. 
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where rj = 7 r/>cr 3 /6 and the Percus-Yevick compressibility equation of state has been used for 
(dpldp)o* 

14-8. Evaluate the square-well second and third virial coefficients from the Barker- 
Henderson theory and compare them to the exact expression as a function of T* = kTje. 

14-9. Using the Grundke-Henderson program for the hard-sphere radial distribution 
function given in Appendix D, evaluate the integrals in Problem 14-7 and compare your 
final results for A to those of Barker and Henderson in their original paper.t 

14-10. Show that Eq. (14-28) reduces to the hard-sphere case when a y = 1. 

14-11. In the second paper in the Barker-Henderson series, { the double Taylor expansion 
of the Helmholtz free energy, Eq. (14-30), is derived. Following the first few pages of their 
paper, derive Eq. (14-30). 

14-12. Show that the Barker-Henderson effective hard-sphere diameter is always less 
than or equal to o. 

14-13. Evaluate the Barker-Henderson effective hard-sphere diameter d(T) for a hard- 
sphere, square-well, and r~ 12 potential. 

14-14. Evaluate the Barker-Henderson effective hard-sphere diameter d(T) for the 
Lennard-Jones 6-12 potential and compare your result to Fig. 1 of Barker and Henderson's 
second paper. § 

14-15. Show that if three-body forces are included in the intermolecular potential, then 
the Barker-Henderson theory gives 


A = A 0 ~\r 2irNp f g 0 (r)u(r)r 2 dr 

^ a 

ar f « 1 

- r+4,+ 4 v *p [ p 1 Mr)u(r)r2 *| 

+ JJ 0c(123)h(123) dr 2 di 3 

where the Percus-Yevick compressibility equation has been used for (dp/dp)o . 

14-16. Using the Grundke-Henderson program for the radial distribution function 
given in Appendix D, evaluate Eq. (14-32) for the Lennard-Jones 6-12 potential and compare 
the results to those given in Table 14-3. 

14-17. The first step in the Chandler-Weeks-Andersen perturbation theory is the calcula- 
tion of the effective hard-sphere diameter d(p, T). This is defined through Eq. (14-37), but 
Verlet and Weislf have presented a simple algorithm from this defining equation. Define a 
quantity d R * by 

d K *(T) = f [1 - e~ >eoM ] dx (1 4-40) 

Jo 


where x^rja and v 0 (x) is the CWA reference potential. Verlet and Weis show that this is 
well represented by the empirical expression: 


0.3837 + 1.068)3* 
0.4293 + £* 


(14-41) 


♦ See J. A. Barker and D. Henderson, J. Chem. Phys.> 47, p. 2856, 1967. 
t /. Chem. Phys. t 47, p. 2856, 1967. 
t /- Chem. Phys 47, p. 4714, 1967. 

§/. Chem. Phys., 47, p. 4714, 1967. 

|| See J. A. Barker, D. Henderson, and W. R. Smith, Phys. Rev. Lett., 21, p. 134, 1968, for a discussion of 
the effect of the inclusion of three-body forces into perturbation theory. 

H Phys. Rev., 5A, p. 939, 1972. 
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where P* — l IT* = ejkT . For 0.7 <T*< 1.6, the error here is less than 2 x 10~ 4 , and for 
1.6 < T* < 4.5, the error may reach 8 x 10" 4 . The error also decreases rapidly with density. 
In addition, Verlet and Weis show that the density dependence of the CWA value of d* 9 
to first order, is given by 

d* = d R *(l+A8) (14-42) 


where 

s * 

210.21 + 404.6/?* 


(14-43) 


is a function of temperature only, and 


A = 


1 - 4.257? „ + 1.362i? w 2 - 0.8751-)?, 


with 


0 — V*) 2 




(14-44) 

(1 4-45) 


where rj = (7 r/6)pd 3 is the packing fraction. This procedure gives the CWA values of d* to 
within 1 percent. Before Eq. (14—45) can be used, however, d must be known. An iterative 
procedure will, however, suffice. As a first choice, set d* — d R *, then evaluate the quantities 
appearing in Eqs. (14—43), (14-44), and (14-45). Thus d* can be found from (14-42) and will 
yield the second choice for d*. Three iterations will yield ample convergence. Use this Verlet- 
Weis algorithm to reproduce the curves in Fig. 14-9. 

14-18. Using the Verlet-Weis algorithm represented in Problem 14-17 and the Grundke- 
Henderson program for the hard-sphere radial distribution function given in Appendix D, 
calculate the CWA reference system radial distribution function according to Eq. (14-36), 
namely, 

go(r) « ya(r)txpl-pu 0 (r)\ 

and compare this to the molecular dynamics results of Hansen and Weis,* and to the molec- 
ular dynamics calculations of Verlet.t 

14-19. The CWA expression for the excess free energy is given by Eq. (14-35): 

J8A A BbAo n C 

-jj- = — + iPp J dr g 0 (r)u(r) 


Using the results from Problems 14-17 and 14-18, reproduce Tables I and II from the paper 
of Weeks, Chandler, and Andersen, f 

14-20. Using Eqs. (14-38) and (14-39) with g(r) replaced by 
go(r) x y a (r)txp[-Pu 0 (r)] 

calculate the pressure and excess internal energy and reproduce Table III in the paper of 
Weeks, Chandler, and Andersen. § 

14-21. According to the approximation discussed in Problem 14-18 show that the struc- 
ture factor of the reference fluid may be written as 

h G {k) - h d (k) + />J dry a (r)[exp(—pu 0 ) — exp(— jSw d )]exp(-/k ■ r) 

Using this, reproduce Figs. 3 and 4 in the paper of Weeks, Chandler, and Andersen.^ 


* Mol. Phys., 23, p. 853, 1972. 
t Phys . Rev., 165, p. 201, 1968. 

%J. Chem. Phys., 54, p. 5237, 1971. See also J. Client. Phys., 55, p. 5422, 1971 
§ /. Chem. Phys., 54, p. 5237, 1971. See also J. Chem. Phys., 55, p. 5422, 1971. 
H/. Chem. Phys., 54, p. 5237, 1971. 
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14-22. Can the Barker-Henderson theory or the Chandler-Weeks-Andersen theory be 
applied to fused salts? Why or why not? Does the requirement of electroneutrality help? 

14-23. The Barker-Henderson theory has been applied to mixtures of real fluids.* Assume 
that the pair potential is of the form 

-(^) } 

along with the combining rules 

£12 — fi 2(^1 1^22) 1,2 
Cl 2 — K*II + 


Show that to first order 


A -Ac 
NkT 


47TpX 


iX 2 d 12 2 go 12 (d 


12 


)Wiz - 8 1; 


.] + 2ttPp 2 XxX u 

X.u 


r- 

J a»x 


Uxui^go^ry 2 dr 


where A 0 and go^Cr) are the Helmholtz free energy and radial distribution functions of a 
hard-sphere mixture with 


and 


dx u = i(dxx + d uu ) 
dxx = Sjlx 

K = f X "[l 

J a 


Grundke et al. show that very good agreement with various experimental data can be ob- 
tained with this equation. 

14-24. Neff and McQuarrief have applied perturbation theory to the calculation of the 
Henry’s law constant. Recall that this is defined by 



where p 2 is the partial pressure of the solute above the solution and x 2 is the mole fraction 
of solute in solution. 

From K h , the molar heat of solution A H s and the partial molar volume solute V 2 can be 
obtained from 


A H s / 1 \ p In K h 

~RT~ \7y [ dT- 1 \ p 



Consider a liquid solution of some solute (denoted by a subscript 2) in some solvent (sub- 
script 1) which is in equilibrium with gaseous solute at partial pressure p 2 . 

p 2 "* = p 2 ” ln 

If we assume the gas phase to be ideal, /x 2 gas = pz*(T) + kT In (p 2 /kT). The chemical potential 
of solute in the solution can be obtained from 


,, solo 

P'2 ~ 



V.T.Nx 


* Cf. e.g., E. W. Grundke, D. Henderson, J. A. Barker, and P. J. Leonard, Mol . Phys. y 25, p. 883, 1973. 
t /. Phys . Chem 77, p. 413, 1973. 
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From Problem 14-23, we have that 


P A = PAo — 4t rp ± N 2 d 12 *go(d 12 )[d 12 — S 12 ] -f 2rrP PtNj f dr r 2 Uu(r)g 0 tJ (r) 

uj j <t u 

where all the symbols are defined in Problem 14-23. 

Following Neff and McQuarrie, show that In K H can be written in the form 

RT U 2 hs \ //x 2 corr \ 

ln ^ = ,n ^ + (y + (ltr) 


where 


^2 


HS 


kT 


--in(i -r>+nMR'+ (j) [^]<*’ + *> 


np 1 d 


11 


*-p 


x(y) 


d+^+>' 2 ) 

d-jO 3 


f*2 


kT 


( 2 ^ g - ) + (n^ 1 ) _ 47rpi d ** 2 ( d ‘* - s ‘2)ffc ii (d 12 ) 


ai 2 = —2 7T f dr r 2 Ui 2 (r)g 0 12 (r) 

J <ri2 

r 00 r^o lf (r)l 

L 1 * r ’“" <r) |-wr L 


Neff and McQuarrie compare this equation and its derivatives to experimental data. 



CHAPTER 15 


SOLUTIONS OF STRONG 
ELECTROLYTES 


Up to this point we have not discussed solutions or mixtures to any extent. Several 
of the problems in Chapters 12 and 13 discuss the extension of certain one-component 
equations to mixtures, but we have not treated mixtures in any detailed way. It so 
happens that many of the concepts and techniques introduced in Chapters 12 and 13 
can be applied to mixtures of nonelectrolytes, but we shall not do this here since not 
many really new ideas are needed. For example, the perturbation theories of Chapter 
14 have been applied quite successfully to mixtures, and this subject is reviewed by 
Henderson and Leonard.* * * § Their article should be quite easy to read since it is based 
upon the material discussed in Chapters 13 and 14. Other specialized treatments of 
mixtures are Molecular Theory of Solutions t by Prigogine, “Liquids and Liquid 
Mixtures by Rowlinson, and Regular and Related Solutions by Hildebrand, 
Prausnitz, and Scott. § 

In this chapter we shall consider solutions of strong electrolytes. We do this in some 
detail not only because ionic solutions are of such great importance in physical chem- 
istry and biophysical systems, but, as we shall see shortly, the coulombic potential 
between charged particles requires special techniques. To see why this is so, consider 
a dilute aqueous solution of some strong electrolyte, say NaCl. For simplicity, let us 
represent the ions as hard spheres of diameters o+ and with the charge of the ion 
located at the center of the sphere. Furthermore, assume that the solvent, water in 
this case, can be represented by a continuous medium of uniform dielectric constant c. 
We also assume that the ions are each made of a material with the same dielectric 
constant as the solvent. This is a commonly used model for solutions of strong electro- 
lytes and is referred to as the primitive model of electrolyte solutions. This primitive 


* D. Henderson and P. J. Leonard, in Physical Chemistry , An Advanced Treatise , Vol. 8B (New York: 
Academic, 1971). 

1 1. Prigogine, Molecular Theory of Solutions (Amsterdam: North-Holland Publ., 1957). 

} J. S. Rowlinson, Liquids and Liquid Mixtures , 2nd ed. (New York: Plenum, 1969). 

§ J. H. Hildebrand, J. M. Prausnitz, and R. L. Scott, Regular and Related Solutions (New York: Van 
Nostrand, 1970). 
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model, then, suggests that we attempt to treat the solution as a gaseous mixture of 
positively and negatively charged hard spheres contained in some volume V and at a 
temperature T. The first deviation from ideality lies in the second virial coefficient, 
which, for a two-component system, is given by (Problem 12-17) 


j ^ — Pi + P2 + B\i(T)Pi2 + BulDPiPi + ^22(T)p 2 2 + • • • 

where in the thermodynamic limit (N and K-* 00 with NjV fixed) 

Bjj(T ) = —2n f a> [e-*w< r) - l]r 2 dr 
•'0 

and 


(15-1) 


(15-2) 


Bi 2 (T) = -An f - 1 ]r 2 dr 

->o 

For the primitive model of an ionic solution, u 12 (r), is 


w 12 (r) = co 


_ (<t + + <r_) 
r < 


g+g- r> (<*+ + g-) 
er 2 


(15-3) 


(15-4) 


where q+ and are the charges on the positive and negative ion, respectively. If we 
substitute this into Eq. (15-3), we find that the integral diverges. This is most readily 
seen by expanding the exponential and then integrating. Physically this divergence is 
due to the fact that the coulomb potential is so long ranged that the two-body inter- 
actions implied by a second virial coefficient do not exist. Thus it is not possible to 
decompose the many-body problem into a series of two-body, three-body problems, 
etc., and a straightforward virial expansion is not applicable. The long-range nature 
of the coulomb potential requires special consideration, and this is the topic of the 
present chapter. 

Before going on to discuss solutions of strong electrolytes, however, we shall briefly 
mention a virial-type expansion of solutions of nonelectrolytes developed by McMillan 
and Mayer.* The essential results of the McMillan-Mayer theory show that there is a 
rigorous one-to-one correspondence between the equations of imperfect gas theory 
and dilute solutions of nonelectrolytes. They showed that the pressure of the gas maps 
into the osmotic pressure of the solution and obeys a virial expansion of the form 

?^=Ci + I Bj* (Ati. T)c 2 " (1 5-5) 

where c 2 is the concentration of solute and jq is the chemical potential of the solvent. 
The virial coefficients in this case are formally identical to those of imperfect gas 
theory, but instead of the potential U N , we use the potential of mean force of N solute 
molecules in the pure solvent. The McMillan-Mayer theory can be extended to the 
development of distribution functions as well, and although in this chapter we shall 
represent the ionic solutions by an idealized model, we should keep in mind that its 
statistical mechanical basis lies ultimately in the McMillan-Mayer theory. 


* For a simplified version of the McMillan-Mayer theory, see T. L. Hill, J. Chem. Phys ., 30, p. 93, 1959. 
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In the remainder of this chapter we shall treat only the primitive model, and often 
the so-called restrictive primitive model, in which the ions all have the same diameter. 
This means that we shall not be able to discuss pressure and temperature effects since 
this would require that we treat the solvent not as a continuum but to recognize 
explicitly the molecular nature of the solvent- In particular, it is the short-range 
potential of mean force that would have to be evaluated, and in the case of water as 
a solvent, this would involve a detailed study of orientation and interaction of water 
molecules around two ions. Such a calculation is at present still very difficult and 
represents a substantially undeveloped area.* The gross oversimplification of replacing 
the complicated short-range interaction (potential of mean force) by a hard-sphere 
potential is not too crucial for fairly dilute solutions, however, since the ions on the 
average will be far apart. 

In Section 15-1 we discuss the famous Debye-Hiickel theory, which we shall see is 
the exact limiting theory of ionic solutions, i.e., exact in the limit of small concen- 
trations. 

In Section 15-2 we shall discuss the connection between the Debye-Hiickel theory 
and statistical mechanics. First, we shall show in this section that the Debye-Hiickel 
theory is an exact limiting law, and then discuss the application of the integral equation 
theories of Chapter 13 to ionic solutions. Lastly, we shall introduce briefly some recent 
theories of ionic solutions that have proved to be very successful. 


15-1 THE DEBYE-HUCKEL THEORY 


Consider a system of positive and negative point charges in a continuum medium 
of dielectric constant e. If we have a set of point charges with charges {qj + } and {q k ~} 
located at the points (r and (rj, then the potential at the arbitrary point r is 


«*) = ! 

j 


e l r — r yl 


+1 


k 



(15-6) 


If instead of point charges we have some continuous charge density p + (r) and p (r), 
then 



P V) dr' 
a|r-r'| 

p( r') dr' 
e|r- r '| 


+ 


r p (t')dt’ 
Je|r-r'| 


(15-7) 


where p(r) = p + (r) + p - (r) is simply the total charge density at r (more properly, 
p(r) dr is the total charge density in dr at r). 

Another fundamental equation of electrostatics that relates the potential to the 
charge density is Poisson’s equation, 


W 2 (j) = 


4np 

e 


(15-8) 


This equation can be derived in a number of ways, the most straightforward being to 
take the Laplacian of both sides of Eq. (15-7). (See Problems 15-2 and 15-3.) Equa- 
tions (15-7) and (15-8) are equivalent, and Eq. (15-7) can be considered to be the 


* Sec, for example. Chemical Physics of Ionic Solutions, ed. by B. E. Conway and R. G. Barradas (New York: 
Wiley, 1966). 
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general solution to Poisson’s equation. Either equation can be used in developing the 
Debye-Huckel theory, but it is customary to use the differential form; namely, 
Eq. (15-8). 

One other fundamental law of electrostatics is Gauss’ law, which states that 



where the integral is taken over some arbitrary closed surface S and where q is the 
total charge enclosed by the surface. The integrand E • dS is the magnitude of the 
component of the electric field perpendicular to the small surface area dS times the 
magnitude of dS itself. We shall use Gauss’ law only for the simple special case of 
spherical symmetry. Suppose that there is a charge q located at some point. If we draw 
a sphere of radius r around this point, the electric field due to this charge will be 
automatically perpendicular to S and also constant at all points on S. Thus Eq. (15-9) 
becomes 


E4nr 2 


4nq 

e 


which gives Coulomb’s law, namely, that 


E 



(1 5-1 0) 


With a little bit of vector analysis, Poisson’s equation can be derived from Gauss’ law; 
so the two equations are really two formulations of the same physical law (Problem 
15-3). Gauss’ law will be used to derive Eq. (15-37). 

With these preliminary electrostatic considerations out of the way, we go on to 
consider the electrostatic Helmholtz free energy of the primitive model of a system of 
N cations and N anions in a volume V and temperature T. By electrostatic Helmholtz 
free energy we mean A —A 0 , where A is the Helmholtz free energy of the charged 
system and A 0 is the free energy of the same system but with all the charges set equal 
to zero. If U N is the total potential energy of the “charged-up” system and U N ° is 
that of the uncharged system of rigid spheres, then 


-P(A-Ac)_ Z N _ J J e ' ' ’ d*2 N 

“z»°" J Se-^dT^-dr^ 

where 


( 15 - 11 ) 


U N 


^ y, Qi Qj 

i>l 


+ il' 

*» j 


(1 5-1 2) 


As usual, the prime on the summations indicates that the i = j term is not included in 
the summation, and q k represents the charge on the kth ion. The quantity w l7 (s) is the 
short-range interaction between ions i and j. We shall omit from consideration the 
self-energy of the ions since this self-energy is unaffected by changes in concentration, 
etc. In the primitive model, « (s) (/ , I -y) is taken to be that of a rigid sphere. 

The quantity 


W r ;) = I 


<U 


i*j £ \ r i ~ r j 


(1 5-1 3) 
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is the electrostatic potential acting upon the yth ion whose center is located at the 
point Tj . This potential is due to all the other ions in the solution. The total electro- 
static potential of the entire system, then, can be written in the form 

^N.elec = i Z <lj t/Fj) < 1 5 “ 1 4 > 

j 

We now define the canonical ensemble average of x/z/i j) by 


<f?> = 


“ VN dT l"' dT 2N 

i--ie~ pVN di 1 -dr 2 „ 


(15-15) 


Given this definition, then clearly Eq. (15-11) shows that (Problem 15-8) 



j = l,2,...,2N 


The total differential of A at constant V and T is 


(1 5-1 6) 


dA = Y dqj = £ <^> dq s ( 1 5 - 1 7 ) 

Thus we see that if we can calculate <^->, we can calculate the Helmholtz free energy 
of the system. We can vary each charge in the system simultaneously from 0 to its full 
value qj by writing dq 5 = q } dX for each j where 0 < X < 1. Then Eq. (15-17) for the 
electrostatic free energy can be written as 
-i 

A-A 0 = Y<lj dX (15-1 8) 

J J o 

where is the average electrostatic potential acting upon the jth ion when each 

of the ions of the system has the charge Xqj . The Debye-Htickel theory involves an 
elegant method for calculating and hence the electrostatic Helmholtz free 

energy, or essentially the canonical ensemble partition function. 

To proceed with the Debye-Huckel theory, first consider the potential at some point r 


m aW9) 

where the summation runs over all the ions in the system. We now canonically average 
^(r), keeping any one particle, say 1, fixed at r t to give 


l <*Kr, r i)> = 


j • • • f H r )e PVn dr 2 ■ - • dr 2N 
J ••• J e ~ fVv dt 2 •■ dt 2N 


(15-20) 


Remember that the left superscript here means that particle 1 is fixed at the origin. 
It is easy to show that 1 <^(r, r^) satisfies Poisson’s equation by taking the Laplacian 
of both sides of Eq. (15-20) with respect to the variable r: 


V 2 (‘<^(r, r t )» 


f — f V 2 ^(r)g ,Vir dr 2 • ■ - dt 2fl 

S f e PVk • • • dt 2N 

c 

J I e~ fVv dx 2 '■•di 2N 


= - — 1 <P( r > fi)> 
e 


(15-21) 
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We can relate *<^(r, r t )> to the quantity <^> that appears in the expression for the 
electrostatic Helmholtz free energy [Eq. (15-18)]. The quantity 1 <^r(r s r 1 )> is the 
electrostatic potential energy at the point r in a system in which particle 1 is fixed at 
r l9 but the other ions are canonically averaged over the volume V. The quantity 

*)> - (1 5-22) 

is the average electrostatic potential at r due to all ions except 1 fixed at r t . Finally, 
we can write that 

<^i> = 1 <^(ri)> (15-23) 

since 1 <^i(r 1 )> is the average electrostatic potential acting upon particle 1 fixed at r u 
and this is just <^> if the system is an isotropic fluid. Thus the 1 <^(r, rj) governed 
by Eq. (15-21) directly gives the terms needed to calculate the electrostatic Helmholtz 
free energy according to Eq. (15-18). 

The total charge density that appears on the right-hand side of Eq. (15-21) can be 
written in terms of radial distribution functions according to 
2 

‘<P(r, r,)> = X c s q,g u (r, r t ) (1 5-24) 

S= 1 

where c s is the bulk number concentration of the j-type ions and g u ( r, r t ) is the radial 
distribution function of j-type ions about the central ion located at r t . Except for a 
negligibly small region near the walls of the container we can set r t = 0 or, alternatively, 
simply recognize that 1 <^(r, i^)) and # ls (r, r x ) depend only upon |r — t t \ . We can 
furthermore express the radial distribution functions in Eq. (15-24) in terms of 
potentials of mean force to give 

47 c 

VWW)) £ C s q s e~» w ^ (1 5-25) 

C s 

This equation is exact up to this point and forms the starting point for the Debye- 
Hiickel theory. 

The first approximation of Debye and Hiickel and is to set the potential of mean 
force equal to 

= q s \U r )> = <ls <l> i(r) r > a (1 5-26) 

where for simplicity of notation we have written <^,(r) for and are letting the 

radii of all the ions be a\2 . This last restriction is easily relaxed, but the notation is 
simpler if we use it. In the primitive model, the right-hand side of Eq. (15-25) vanishes 
for r < a. If Eq. (15-26) is substituted into Eq. (15-25), we get the Poisson-Boltzmann 
equation, 

4ft 

V 2 4>i(r) = X c s q s e- fqAm r>a (1 5-27) 

8 s 

which is a nonlinear differential equation for 4>i (*), which in turn is used to calculate 
the electrostatic Helmholtz free energy of the system. Being nonlinear, this equation 
is generally difficult to solve, and the second approximation of the Debye-Huckel 
theory is to linearize the right-hand side by expanding the exponentials. Thus we write 

X c s q s e~ fq ’* lit) » X - P X c sQs 2 4>i(r) 


(15-28) 
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Remember that c s is the number concentration of species s, i.e., that c s = NJV, and 
so the first summation here vanishes because of electroneutrality; the total positive 
charge must equal the total negative charge in the system. 

This linearization procedure gives the linear Poisson-Boltzmann equation: 

V 2 <£ i(r) = fc 2 ^(r) r > a (1 5-29) 

where 

k 2 = — (15-30) 

S s 

Notice that k 2 is closely related to the ionic strength, 

J = ±£& 2 c s 

5 

This is the basic equation of the Debye-Hiickel theory. We have introduced two 
approximations in going from the exact Eq. (15-25) to the linear Poisson-Boltzmann 
equation. We shall see that both of these approximations become justified (in fact 
are not approximations) in the limit of small concentrations. Thus although it is not 
obvious from the above derivation, Eq. (15-29) is exact as k -+0. 

Equation (15-29) is appropriate only for the region in which r> a. For r < a, the 
centers of the surrounding ions are excluded (see Fig. 15-1) and hence the appropriate 
equation is Laplace’s equation, namely, 

V 2 <£i(r)=0 0 <r<a (15-31) 

We now wish to solve Eqs. (15-29) and (15-31) for ^(r). The boundary conditions 
that must satisfy are that ^(r) must vanish at infinity, that <f>i(r) be continuous 
across the surface specified by r = a 9 and that e times the normal derivative of 
be continuous across the surface r = a. This last condition comes from the fact that 
the normal component of the displacement vector D (Problem 15-6) is continuous 
across any charge free surface. These three conditions will specify ^(r) uniquely. 



Figure 15-1. An illustration of the boundaries involved in solving the linear Poisson-Boltzmann equation 
for the restricted primitive model. 


THE DEBYE-HUCKEL THEORY 333 


Since particle 1 is fixed at the origin and <j>i(r) is spherically symmetric, we naturally 
use spherical coordinates, which gives 




r> a 


(1 5—32) 


The general solution of this equation is (Problem 15-9) 


. , x A,e~ Kr B t e Kt 


r> a 


(1 5-33) 


where A t and B t are constants of integration. Since fa must vanish as r -► oo, we set 
Bi = 0. So we have 




r>a 


(1 5-34) 


We can determine A l by applying the boundary conditions at the surface r = a. 
For the region between r = 0 and r = a (cf. Fig. 15-1), we have 


which immediately gives 


< r <a 


(1 5-35) 


fa(r) = — + B 2 0 <r <a 


(1 5-36) 


We can use Gauss’ law to evaluate the constant A 2 . Applied to this problem. 
Gauss’ law says that for 0 < r < a y 


(-£)- 


4 7tq t 0 < r < a 


from which we find that A 2 = qje. Equation (15-36) then becomes 


<£i( r ) = “ + ^2 0 <r < a 

er 


(1 5-37) 


The conditions that fa (r) and edfa/dr [c times the normal derivative of fa(r)] must be 
continuous at r = a can be used to determine the two remaining constants A x and B 2 . 
These two conditions give 


+ B 2 

a ea 


cr a tar 


from which we find that 


1 a(l + Ka) 


(1 5-38) 


<h* 

a(l + Ka) 


(15-39) 
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Putting all this together, then, we have 




K 

fi(l + Kd) 


0 <r<a 


g 1 e~ K(r ~ fl) 
er(l + Ka) 


r> a 


(1 5-40) 


Note that Eq. (15-40) is of the form e~ Kr /r, i.e., what is called a screened coulombic 
potential. 

According to Eqs. (15-22) and (15-23) 


<^i> = l <»M r )> = 4>i(r) ~ — 

er 

which, when Eq. (15-40) is used, gives 


(15-41) 


<^i> = - 


qi k 


c(l + Ka) 
or more generally 

qjK 


<$j) = 


c(l 4- Ka) 


(1 5-42) 


(1 5-43) 


This, then, is the average electrostatic potential acting on the jth ion fixed at the 
origin. This potential is due to all the other ions in the solution. 

Before going on to derive the thermodynamic expressions associated with the 
Debye-Hiickel theory, we shall first discuss some general requirements that <^> must 
satisfy, and then we shall discuss the linearization approximation of the Debye- 
Hiickel theory. 

The quantity <^> must satisfy two conditions. According to Eq. (15-26), Debye 
and Hiickel assume that 

WjAr) = (1 5-44) 

where we have written j instead of 1. From the formal definition of the potential of 
mean force in a fluid, we see that 

Wjs(r) = w sj ( r ) (1 5-45) 

which requires that k <^(r)> be such that 

qf<Mry> = qJ<Mry> o 5-46) 

Note that *<^(r)> given by Eq. (15-40) does, in fact, satisfy this self-consistency con- 
dition. Another condition on *<^(r)> follows directly from Eq. (15-17). Since A is 
an exact differential, we have the requirement that 


dqj dq i 


(1 5-47) 


for all i and j. It is easy to show that <^> given by Eq. (15-43) satisfies this condition 
(Problem 15-18). It is interesting to note that solutions to the nonlinear Poisson- 
Boltzmann equation do not satisfy these self-consistency conditions, and so the lineari- 
zation employed by Debye and Hiickel is a necessary step in deriving an exact limiting 
theory. 



THE DEBYE-HUCKEL THEORY 335 


It is instructive to examine <£//*) further. For example, the total charge surrounding 
the central ion j is given by 

r 00 

J J (p(r)y4nr 2 dr (15-48) 

which, according to Poisson’s equation, can be written as 


- h. r vw(r)>}47tr2 dr 


(1 5-49) 


The Debye-Hiickel theory sets V 2 { J <^(r)>} equal to /c 2j <^(r)> = K 2 <j)j(r), where <£//*) 
is given by Eq. (15-40). Substituting this into Eq. (15-49) gives 


■if 

471 J 


OC 


/c 2 ^g~ K(r ~ fl) 
er( 1 + Ka) 


4nr 2 dr = 


-9j 


(1 5-50) 


This shows that the total charge surrounding a central j-ion is equal and opposite to 
the charge qj on the j ion. 

Equation (15-50) shows that 

p(r) dr — — ■ 4ft/- 2 dr = - ? jK e~ K(f ~ a) r dr r>a (15-51) 

47t 1 + Ka 


is the fraction of charge between the spherical shells of radius r and r + dr. The 
quantity p(r) is plotted in Fig. 15-2, which shows that there is a tendency for ions 
having a charge opposite to that of the central ion to distribute themselves around the 
central ion. Thus it is said that Eq. (15-51) and Fig. 15-2 describe the ionic atmosphere 
around a central ion. 



KT 

Figure 15-2. The fraction of charge between the spherical shells of radius r and r + dr. This illustrates 
the idea of an ionic atmosphere. 
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The maximum in p(r) is given when r = 1/k, and so values of r around this value 
are the most important in determining the thermodynamic properties of the solution. 
Equation (15-30) shows that k oc c 1/2 , where c is the concentration of electrolyte, and 
so the important values of r become larger as c becomes smaller. For an aqueous 
solution of a 1-1 electrolyte at 25°C, k ~ 1 =3.04 c" 1/2 A (angstroms) when c is the 
concentration of salt expressed in moles per liter, and so k ~ 1 is about 30 A when 
c = 0.01 mole/liter. At these concentrations then, the short-range interaction between 
ions does not play an important role, and its precise nature is unimportant. Problem 
15-20 shows that at concentrations around 0.01 mole/liter or less, the linearization 
of the Poisson-Boltzmann equation is valid as well. 

The above paragraph shows that the Debye-Hlickel theory is valid only for fairly 
small concentrations and becomes better as the concentration becomes smaller. We 
cannot show at this point that the approximation w i7 (r) = q { <j>j(r) is valid for small 
Ka or concentration, but we shall show in Section 15-2 that in the limit of small Ka that 

(1 5—52) 

£r 


This is just what is obtained from Eq. (15—40) for small Ka. Thus we say that the 
Debye-Hiickel theory is an exact limiting law, and we expect that the theory correctly 
describes the thermodynamic properties of all solutions of strong electrolytes as the 
concentrations approach zero. It is a reliable theory for 1-1 electrolytes for concen- 
trations around 0.005 mole/liter or less or, in other words, for quite dilute solutions. 
Nevertheless, the Debye-Hlickel theory represented a great advance in the theory in 
ionic solutions since before it there was some amount of disagreement and confusion 
concerning the nature of such solutions. 

We shall now derive the thermodynamic expressions of the Debye-Hiickel theory. 
The electrostatic Helmholtz free energy is given by Eq. (15-18): 


A - A 0 = £ q s f <^/A)> dX 

i J 0 


( 1 5 - 53 ) 


Remembering that k depends upon the magnitude of the charge on the ions, we write 


A — A 0 


qf r 1 Xk(X) dX 

j e J 0 1 + k(X)o 


(1 5-54) 


Using Eq. (15-30) for k, this integral is easy and gives (Problem 15-12) 


PA' 1 _ IKA-A 0 ) 
V V 




(1 5-55) 


where 

3 [ K?a 2 \ 

T ( Kfl ) = ^3 j 1 ^ 1 + Ka ) - Ka + ~2~] 

For small k, t (tea) becomes 

t(/ca) « 1 — J/ca + i( Ka ) 2 + • " 

The electrostatic chemical potential of the jth ionic species is given by 

= rr_L/££!Y| 

J \dNj)v,T Pl 3Nj\ V )\t,v 


(15-56) 


(1 5-57) 


(1 5-58) 
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Using Eq. (15-55) for PA*lV, we find that 
ei k Qj 2 

nf = - tttt 1 — ^ (1 5-59) 

1 2<1 + m) 

This equation can be derived in quite a different manner by using a simple generaliza- 
tion of Eq. (13-34). (See Problem 15-14.) We can calculate the electrostatic Gibbs 
free energy from this by using 

G e, = I w 

j 

This gives 

pG* K 3 

= (1 5-60) 

V 87r(l + Ka) K ; 

In addition, we can determine the electrostatic osmotic pressure from the equation 

v v 


(1 5-60) 


Using Eqs. (15-55) and (15-60), we get 


pp tl = 


24n 1 1 + Ka 


2t(kb) 


(15-61) 


Fowler and Guggenheim write this as (Problem 15-15) 


Pp" = 


c(jca) 


(1 5-62) 


where 


^ + “ - 2 + “>) 
« 1 — f Ka + y (tea) 2 + ■ ■ • 


(1 5-63) 


The above expressions for the various thermodynamic functions, although fre- 
quently seen in the literature, are somewhat misleading since, as we have seen earlier, 
the entire theory should be valid only in the limit Ka -► 0. We should really use only 
the limiting forms of these equations. For example, Eq. (15-55) for PA cl /V is 

PA* 1 K 3 

— = -T2^ T(Ka) 

The limit of this as Ka -* 0 is 
pA* k 3 

~r = - IS (15 - 64) 

This goes for the other thermodynamic functions as well: 
pG * k 3 

— (15-65) 


Pp* = 


(1 5-66) 


Ps l = 


(1 5-67) 
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Since these electrostatic quantities are taken as ko -* 0, we can write 




24 n 


(15-68) 


and 


Hj = kT In A j 3 + kT In Cj 


2e 


(1 5-69) 


We can use these equations to present two other thermodynamic functions that are 
commonly used in the study of ionic solutions. We define the osmotic coefficient <j> by 


Cl +c 2 

which, in the Debye-Hiickel theory, is 


24n(c i + c 2 ) 


(1 5-70) 


(15-71) 


For an ideal solution <f> is unity. 

The other thermodynamic function we wish to discuss is the activity coefficient y jy 
defined through the equation 


Hj = kT In A j 3 + kT In Cj + kT In yj 


(1 5-72) 


By comparing this equation with Eq. (15-69), we get 


In yj = 


Wj_ 

2ekT 


(15-73) 


If instead of Eq. (15-67) for pf 1 we use Eq. (15-59), we find that 


In yj = - 


Kq/ 


2kTe{\ + kq) 


(1 5-74) 


Although this equation is not, strictly speaking, valid, it is often used in the treatment 
of experimental data and will, in fact, be used in the next section. This and Eq. (15-40) 
are often referred to as the extended form of the Debye-Hiickel theory. 

Of course, individual activity coefficients cannot be measured experimentally. One 
measures the mean activity coefficient,* which for a binary salt is defined by 

y± =(y + v *y- v -) i,v d 5-75) 


where v = v + + v_ and v + and v„ are given by the chemical equation describing the 
dissociation of the electrolyte: 

V +C + + v -^ 

From Eq. (15-75) we write 


In y± 


v + In y+ + v_ In y _ 
v+ + v_ 


* W. J. Moore, Physical Chemistry , 4th ed. (Englewood Cliffs, N.J.: Prentice-Hall, 1971). 
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which, with the use of Eq. (15-73) for In y j, becomes 
, /V + 0 + 2 + v_q_ 2 \ K 

lnr ‘~~l )m- 


(15-76) 


Because of electroneutrality, v+q+ + v_q_ = 0, this can be rewritten as (Problem 
15-17) 


lny± = -\q + q-\ ^ (15-77) 

A plot of In y ± versus the square root of the ionic strength should yield a straight line 
for sufficiently low concentrations, and the slope of the line should depend upon the 
type of electrolyte through the factor \q+q-\. Figure 15-3 shows such a plot for 
several different types of electrolytes. At higher concentrations we expect to find 
deviations from such limiting behavior, and this is shown in Fig. 15-4 for a number 
of 1-1 electrolytes. Note, however, that all of these salts yield the same limiting be- 
havior. At the higher concentrations, the size and the nature of the short-range 
interactions of the various salts become important, and each salt shows individual 
behavior. 

In this section, then, we have discussed in some detail the Debye-Hiickel theory, 
which is the exact limiting law for electrolytic solutions. Deviations are expected and 
observed for concentrations greater than 0.01 mole/liter. In the next sections we shall 
discuss the statistical basis and extensions of the Debye-Hiickel theory to higher 
concentrations. 



VT" x io 3 

Figure 15-3. Activity coefficients of sparingly soluble salts. 
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Figure 15-4. Logarithm of the mean ionic activity coefficient of aqueous solutions of alkali halides at 
25°C. (a) LiCl, (b) NaCl, (c) KC1, (d) RbCl. (Data from R. A. Robinson and R. H. 
Stokes, “Electrolyte Solutions,” 2nd ed., Butterworths: London, 1965.) 


15-2 SOME STATISTICAL MECHANICAL 
THEORIES OF IONIC SOLUTIONS 

In spite of the great success of the Debye-Hiickel theory, when it was originally 
proposed its range of validity was not at all clear. Its success stimulated a great deal 
of work to place the Debye-Hiickel theory within a statistical mechanical framework 
or, in simple terms, to derive it from a partition function. There are two important 
reasons for doing this. One is elucidate just what approximations are needed to derive 
the final Debye-Hiickel equations and to investigate the range of concentrations over 
which these approximations are valid. The other reason is to formulate a method by 
which the approximate theory can be successively corrected. For example, it is not 
obvious how to improve the Debye-Hiickel theory as we have presented in Section 
1 5-1 , but if we could derive it by starting from the configurational integral, for example, 
then we could formally write down correction terms and investigate these. This could 
lead to a series of systematic corrections. 

An appealing approach is to use the virial expansion developed in Chapter 12 (or 
really the McMillan- Mayer theory since this is a solution rather than a gas). We saw 
earlier in this chapter, however, that the virial coefficients diverge for the coulombic 
potential. A great breakthrough along these lines was made by Mayer* in 1950. He 
showed that although individual cluster integrals diverge, it is possible to combine the 
infinite parts of all the virial coefficients B „ , for n > 2, so that they mutually cancel 
and yield a finite result. By a careful evaluation of the graphs from all the virial co- 
efficients, he was able to show that the Debye-Hiickel theory was obtained if only 

* J. E. Mayer, J. Chem . Phys., 18, p. 1426, 1950; see also E. Haga, J. Phys. Soc . Japan , 8, p. 714, 1953. 
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terms in the lowest order in the concentration were kept. A complete discussion of 
Mayer’s approach is too involved to give here, but it is possible to present the general 
idea. 

We start with the equations for an imperfect gas, namely, 

-^ = c+££*c* (15-78) 

AC 1 k>2 

where the virial coefficients are given in terms of cluster integrals 
1 „ 

&k — ^ — Pk-i 


and the cluster integrals are given by 


Pk ~ l ~ (k — 1)! V f I s ' 1 - 2 ‘ " ’ ‘ dlk 

where S[ t 2 , . k IS the sum of all stars (doubly connected diagrams) of k particles. We 
wish to generalize these equations to more than one component, and to this end it is 
convenient to rewrite them in a slightly different form. Substituting Eq. (12-33) into 
Eq. (15-78) gives 


P_ 

kT 


c-E (jL r^^-i ck 

k>2 AC 


We now define B fc by 


A-i 




k 



,kd T 1 “ “ “ d T k 


(1 5-79) 


and so we can write 


l- = c -ZVc-l)B k c k . 

For two components this equation becomes simply* 


P 

kT Cl 


= Cl + C 2 — E (^1 + ^2 — 1) c 1 k ‘c 2 kl 


where 


*i>i 
k 2 > 1 


B 


1 


kik 2 


k x lk 2 \V 


/■■■ J 5 *. 2 


+i, ...,fci+k 2 


dr x “ dr 


ki+k 2 


(1 5-80) 


(1 5-81 ) 


The integrand of B *l*2 is a sum of products of /-functions. Now as Mayer indicated, 
Eq. (15-80) is a multiple sum in which the order of summation has been implicitly 


* J. E. Mayer, “Theory of Real Gases,” in Handbuch der Physik, Vol. 12, edited by S. Fliigge (Berlin: 
Springer-Verlag, 1958). 
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specified, namely, that one sums over all products in the integrand for a specific 
B *l*2 and then over all values of k A and k 2 . As long as convergent results are obtained, 
it is equally legitimate to sum first over all values of k t and k 2 for some specified type 
of product in the integrand, and then sum over all types of products. By doing this in 
a particular way, we will obtain convergent results for the coulomb potential. 

Mayer defines a summation function S by 


S — £ Bfcik 2 c i fclc 2 fc2 


( 1 5 - 82 ) 


fci>i 

k 2 >l 


In terms of S (Problem 15-27) 


p dS dS 

TTr ~ Ci C 2 + & — Cj — — C 2 — 

kT uc | 5c2 


( 15 - 83 ) 


and 


i 35 

-""■'-fc 


( 1 5 - 84 ) 


If we classify all of the graphs generated in the B k,k 2 into various mutually exclusive 
sets, say {/}, then 


s = Y& J) 


( 1 5 - 85 ) 


where S U) is Eq. (15-82) applied only this class of graph. 
To see what types of graphs are generated, consider 


Ul! =™j±z +U « 


u 


er 


( 1 5 - 86 ) 


The factor e ar is a “ convergence factor” to assure that the various integrals to arise 
converge. We are, of course, only interested in the limit a -► 0. Now define 


2 = 


4n 

eicT 


,-ar 


* r) -*Tr 

k(r) = e~ uia)/kT -l 


( 1 5 - 87 ) 


The /-function that occurs in any can be written as 

f tj = ki j+(k i j+ 

= hj + ihj + 1) E XqhMj 1 (i 5-88) 

I>1 I'- 

This is simply an identity (Problem 15-29). If this is substituted into any particular 
cluster diagram, we get an infinite number of terms involving k {j , powers of , and 



SOME STATISTICAL MECHANICAL THEORIES OF IONIC SOLUTIONS 343 


products of kij and powers of g Xj . These may be represented graphically by drawing 
a dotted line for a £-bond between two particles and a solid line for each ^-bond. 
Some examples of the types of graphs that occur are 



Certainly the types of graphs encountered here are more complex than those of im- 
perfect gas theory, but they can, in fact, be classified and summed in an organized way. 
The delicate part of Mayer’s analysis is to classify them according to the power of the 
concentration that they contribute to S. Mayer showed that if only the so-called 
cyclic graphs are included in S (cf. Fig. 15-5), then the Debye-Huckel expressions are 
obtained and that all other types of graphs contribute a higher order in the concen- 
tration. This is too involved to show here, but Problem 15-31 treats the evalua- 
tion of .S (cycIic) . The result is 


^(cyclic) 

I2n 


(1 5-89) 


from which the Debye-Huckel expressions for p\kT and In yj are obtained through 
Eqs. (15-83) and (15-84). 

Equation (15-89) is the leading term in an expansion of S in the concentration 


Cyclic: 0=0 


p(l): (J---0 




Figure 15-5. The cyclic graphs of the Mayer ionic cluster theory which, when included in S, give the 
Debye-Hiickel limiting law. Also included is another set of graphs which Problem 15-30 
shows gives no contribution to S because of electroneutrality. 
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For the primitive model, the expansion has been extended to higher order, and to 
terms in c 2 In c, we have 


k 3 2n 


4 4 

— 3/ x2 K a K l 

a 3 ( Ct + Co) 1 — 

Un 3 V 1 2) 16 n 192 n 2 


[y + ln(3 kq) — 3 ko] 


oo (-\y 

+ 2na 2 Y 1 } 

y= 4 j! (J — 3) (4na) 


Kj- 2 


j + 0(k 5 ) 4- 0(/c 5 In k) (1 5-90) 


where a is the diameter of the hard-sphere core and 


Kl- 2 = * * * §,/2 (Cl9l' + c 2 tfz') (1 5-91 ) 

This k, 2 _ 2 reduces to the Debye-Huckel k 2 when 1 = 2. 

The various thermodynamic functions can be easily obtained from Eq. (15-90). 
Poirier* has made an extensive application of Mayer’s theory to experimental data, 
and the “practical” aspects of the theory have been discussed by Frank and Tsao.f 
It seems to be useful up to concentrations of 0.4 molar for 1-1 electrolytes. We see 
then that Mayer’s theory is an ionic solution analog to the cluster expansion of im- 
perfect gas theory. 

Another approach that suggests itself is the application of the integral equation 
methods of Chapter 13. The central function there is the radial distribution function 
of ions around some central one. Falkenhagen and KelbgJ review much of the work 
done prior to 1959, most of that being done by Falkenhagen and Kirkwood and 
Poirier. § More recently, however, the Percus-Yevick and hypernetted-chain equations 
have found most favor. 

Let us go back to the Ornstein-Zernike equation 
h(r 12 ) =c(r 12 )+pj c(r l3 )h(r 23 ) dt 3 

Both the Percus-Yevick and hypernetted-chain equations are obtained from this by 
assuming a particular relation between the direct correlation function c(r) and the 
radial distribution function. According to Eq. (13-74), at low densities c(r ) = /(r), and 
the direct correlation function of both the Percus-Yevick and hypernetted-chain 
equations reduces to this correct limit at low densities. We shall see that if we substitute 
this expression for c(r) into Eq. (13-55) and linearize the exponentials, we shall get 
the Debye-Huckel potential of mean force. Before doing this, however, we need the 
generalization of Eq. (13-55) for two components, || which is 


M r > r ') = c i/ r > r ') + f c ii( r > x )^(j( x > *•') dx 


(15-92) 


We can immediately obtain the Debye-Huckel result from this be letting 

Cij =fij = e~ eu ‘J - 1 = -p Ui j (1 5-93) 

where u u and writing h u = g u — 1 = —fiWij where w tJ is the potential of 

mean force. We are neglecting the hard-core cutoff since we are interested only in the 


* J. C Poirier, /. Chem . Phys., 21, p. 965, 1953; 21, p. 972, 1953. 

f H. S. Frank and M. Tsao, Ann. Rev . Phys. Chem., 5, p. 43, 1954. 

t H. Falkenhagen and G. Kelbg, in Modern Aspects of Electrochemistry , Vol. II, edited by J. O. M. Bockris 
(New York: Academic, 1959). 

§ J. G. Kirkwood and J. C. Poirier, J. Phys. Chem., 58, p. 591, 1954. 

|| J. L. Lebowitz, Phys. Rev., 133, p. 895 A, 1964. 
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limit of dilute solutions. This same condition also applies to the linearization of the 
exponentials in f(r ) and h(r). This gives 




(1 5 — 94 ) 


It is convenient to write w u as (gigjls)w(r), in which case Eq. (15-94) becomes 
1 1 v" 2 f w ( r ij) 




Now the integral on the right-hand side here is independent of / since we are integrating 
over r, , and so we can write more neatly 




( 1 5 - 95 ) 


This is an integral equation for w(r) and can be solved easily in several ways. The most 
straightforward is to introduce Fourier transforms as we did in going from Eq. (13-55) 
to Eq. (13-57). If we denote the Fourier transform of w(r) by W and that of r -1 by 0, 
then Eq. (15-95) becomes (Problem 15-32) 


W(t) - 


0(t) 


K 2 0(t ) 


( 1 5 - 96 ) 


According to Eq. (12-51), 




Notice that we have introduced e " as a convergence factor. If this is substituted 
into Eq. (15-96), we get 

"W-PT? 

This can be inverted to give w(r) by using Eq. (12-52): 
n\ il2 1 rsin tr . 


_ /2\ ' 1 r 00 t sin tr 

yr/ r J 0 t 2 + tc : 


This integral is elementary, and we find 


w(r)= — 


which, if the factor g^jls is reinserted, gives the Debye-Hiickel expression for the 
potential of mean force. Thus again we find that the Debye-Hiickel theory is an exact 
statistical mechanical result for dilute conditions. This same result can be obtained 
from the Kirkwood and Born-Green- Yvon equations (Problem 15-33). 
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We can readily extend the above argument to include a hard-sphere repulsion of 
the ions. In this case, w(r) — oo for r < a and for r > a is given by Eq. (15-95) but 
with the restriction that |r — r'| cannot be less than a: 

, . 1 k 2 r w(|r — r'|) 

w(r)=--— — — ; dr ' r> a (15-97) 

r 471 J r 

|r-r'| >a 

Kirkwood and Poirier* have derived this equation in quite a different manner. This 
equation is much more difficult to solve than Eq. (15-95), but some general features 
of the solution were extracted by Kirkwood and Poirier. An intriguing result is that 
at a certain concentration, given by the condition Ka > 1.032, w(r) assumes an oscilla- 
tory form, indicating zones of alternating positive and negative charge densities. This 
oscillatory behavior has been found in other formulations of the theory of ionic 
solutions as well.} 

Recently Rasaiah and Friedman} have solved the Percus-Yevick and hypernetted- 
chain equations numerically. Since these equations have found much success in the 
theory of liquids, one expects that this should be a powerful method for concentrated 
solutions. Most of the results have been tabulated for the restrictive primitive model. 
It has been found that the hypernetted-chain equation} is superior to the Percus- 
Yevick equation § at least for a 1-1 aqueous electrolyte at concentrations up to 1.0 M. 
Fortunately, Monte Carlo calculations have been performed by Card and Valleau|| 
to which the integral equation results can be compared. Table 1 5-1 shows a comparison 
of the osmotic coefficient for an aqueous solution of a 1-1 electrolyte calculated from 
the PY equation and HNC equations to the Monte Carlo calculations for an aqueous 
solution of a 1-1 electrolyte. Remember that the value of the pressure calculated from 
the integral equations depends upon whether one uses the so-called pressure equation 
[Eq. (13-23)] or the compressibility equation [Eq. (13-54)]. It can be seen from the 
table that the results from the hypernetted-chain equation are more internally con- 
sistent than those from the Percus-Yevick equation and that the general agreement is 
quite good even up to almost 2 M. 

Figures 15-6 and 15-7 show the osmotic coefficient and the logarithm of the mean 
activity coefficient versus the square root of the ionic strength. The curves labeled 


Table 15-1. A comparison of the solutions of the hypernetted-chain and Percus-Yevick equations to the 
Monte Carlo results for the osmotic coefficient for a restricted primitive model of a 1-1 
electrolyte with a = 4.25 A, e = 78.5, T— 298 L K 


c 

(moles/liter) 

& MC 

^ hnc 

& hnc 

<f>r A 

& PYA 

0.00911 

0.9701 ±0.0008 

0.9703 

0.9705 

0.9703 

0.9705 

0.10376 

0.9445 ±0.0012 

0.9453 

0.9458 

0.9452 

0.9461 

0.42502 

0.9774 ±0.0046 

0.9796 

0.9800 

0.9765 

0.9844 

1.0001 

1.094 ±0.005 

1.0926 

1.0906 

1.0789 

1.1076 

1.9676 

1.346 ±0.009 

1.3514 

1.3404 

1.3114 

1.386 

Source: J. C. 

Rasaiah, D. N. Card, and J. 

P. Valleau, J. Chem. Phys. 9 

56, p. 248, 

1972. 


* J. G. Kirkwood and J. C. Poirier, /. Phys. Chem. , 58, p. 591, 1964. 

t C. W. Outhwaite, /. Chem . Phys. 9 50, p. 2277, 1969; F. H. Stillinger and R. Lovett, /. Chem. Phys. 9 48, 
p. 3858, 1968. 

% J. C. Rasaiah and H. L. Friedman, /. Chem . Phys. 9 48, p. 2742, 1968; 50, p. 3965, 1969; J. C. Rasaiah, 
Chem . Phys. Lett ., 7, p. 260, 1970; /. Chem. Phys. 9 52, p. 704, 1970. 

§ As modified by Allnatt: A. R. Allnatt, Mol. Phys. 9 8, p. 533, 1964. 

|| D. N. Card and J. P. Valleau, /. Chem. Phys. 9 52, p. 6232, 1970. 
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Figure 15-6. The osmotic coefficient <f) for an aqueous 1-1 restricted primitive model electrolyte with 
hard-sphere diameter 4.6 A, obtained in various ways. DHLL denotes the Debye-Hiickel 
limiting law (sum of ring diagrams in cluster theory). DHLL + B 2 denotes the sum of all 
ring diagrams and all two-particle cluster diagrams. HNC denotes the solution of the hyper- 
netted-chain equation, and Ml and M2 denote the mode expansion results. (From D. 
Chandler and H. C. Andersen, J. Chem . Phys., 54, p. 26, 1971.) 



Figure 15-7. The logarithm of the mean ionic activity coefficient, In y± , for an aqueous 1-1 restricted 
primitive model electrolyte with hard-sphere diameter 4.6 A, obtained in various ways. See 
Fig. 15-6. (From D. Chandler and H. C. Andersen, /. Chem. Phys.> 54, p. 26, 1971.) 


DHLL are the Debye-Hiickel limiting law ; those labeled DHLL + B2 are for the sum 
of all ring diagrams (DHLL) and all two-particle cluster diagrams in Mayer’s ionic 
cluster theory; HNC is for the hypemetted chain as modified by Allnatt ;* and the 
designations Ml and M2 will be discussed shortly. The application of integral equation 


* A. R. Allnatt, Mol. Phys., 8, p. 533, 1964. 
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methods to ionic solutions has been very well developed by Rasaiah and Friedman, 
who discuss the numerical techniques involved and present a thorough analysis of the 
accuracy of their solutions. 

Figures 15-6 and 15-7 contain two other curves, labeled Ml and M2 that we have 
not yet discussed. They are the results of a series of three papers by Andersen and 
Chandler* on a novel and powerful expansion for the Helmholtz free energy of a 
classical system. The potential energy of the system is assumed to be composed of two 
parts: a “reference system” potential and a perturbation potential, which is the sum 
of two-particle potentials. This two-particle potential must have a Fourier transform. 
They then introduce collective variables, which are Fourier transforms of the single- 
particle density, and the excess Helmholtz free energy is written as an infinite series 

BA ex ~ 

= ^ = ^o + I>„ (1 5—98) 

y f»= i 

where s/ 0 is the excess free energy of the reference system and the a n are determined 
by the perturbation potential and the distribution functions (two-, three-, and higher- 
particle) of the reference system, which are assumed to be known. The expansion 
[Eq. (15-98)] is called a mode expansion by Andersen and Chandler. 

In the second paper of their mode expansion series, they apply their technique to 
the calculation of the thermodynamic properties of the restricted primitive model of 
a 1-1 electrolyte. In this model the reference potential is a hard core, and the pertur- 
bation potential is coulombic outside the hard core but arbitrary within. Chandler 
and Andersen found that the rate of convergence of the mode expansion depended 
upon what was chosen for the perturbation potential within the hard core (cf. their 
Fig. 2 for examples) but that very rapid convergence could be achieved from simple 
choices. The results of using a x and a 2 in Eq. (15-98) are shown in Figs. 15-6 and 15-7 
as Ml and M2. The equations are not difficult to apply in this case, and the numerical 
results compare quite favorably with the integral equation methods of Rasaiah and 
Freidman. Tables 15-2 and 15-3 show a comparison of the pjode expansion method 
with the hypernetted-chain results. Table 15-2 also shows the convergence of the 
method for the excess Helmholtz free energy. It should be pointed out that the results 
in Tables 15-2 and 15-3 agree quite well with the Monte Carlo calculations of Card 
and Valleau. 

For high values of z Y z 2 e 2 lekTR, the convergence of the mode expansion is not fast, 
i.e., a 2 is not <a x . In the third paper of their mode expansion series, Andersen and 
Chandler describe a criterion for optimizing the rate of convergence of the mode ex- 
pansion. This criterion ensures that the distribution functions vanish in the physically 
inaccessible region within the hard core. The series [Eq. (15-98)] truncated at n = 1 
is called the random-phase approximation (RPA), and when the Andersen-Chandler 
optimization is applied, it is called the optimized random-phase approximation 
(ORPA). A remarkable result of this optimization criterion is that ORPA is equivalent 
to another theory, the mean spherical model, if the hard-core reference system is 
approximated by the Percus-Yevick theory. The mean spherical model (MSM) was 
applied to ionic solution theory by Waisman and Lebowitzt and is discussed in Prob- 
lems 15-37 through 15-39. The addition of the n = 2 term in Eq. (15-98) then repre- 
sents an improvement over MSM and is now referred to as the MEX theory. 

* H. C. Andersen and D. Chandler, /. Chem.Phys ., 53, p. 547, 1970; 54, p. 26, 1971; 55, p. 1497, 1971. 
f E. Waisman and J. L. Lebowitz, /. Chem.Phys. ,52, p. 4307, 1970; 56, p. 3086, 1972; 56, p. 3093, 1972. 
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Table 15-4 compares the thermodynamic properties of the restricted primitive 
model of a 1-1 electrolyte calculated by the MEX theory to the Monte Carlo data of 
Card and Valleau. 


Table 15-2. Excess free energy of a model Ionic solution 


/* * * § 

s/olc 

Ol/C 

a 2 /c 

^/ct 

HNC s/kt 

difference 

0.05 

-0.0247 

0.2291 

-0.0157 

0.2587 



0.1 

-0.0495 

0.3990 

-0.0249 

0.3247 

0.3262 

-0.0015 

0.2 

-0.0998 

0.5225 

-0.0362 

0.3866 

0.3854 

+0.0012 

0.3 

-0.1508 

0.6055 

-0.0431 

0.4117 

0.4078 

+0.0039 

0.5 

-0.2553 

0.7203 

-0.0506 

0.4143 

0.408 

+0.006 

0.7 

-0.3635 

0.8011 

-0.0541 

0.3837 

0.376 

+0.008 

0.8 

-0.4185 

0.8341 

-0.0551 

0.3605 

0.352 

+0.009 

0.9 

-0.4747 

0.8636 

-0.0556 

0.3333 

0.328 

+0.005 

1.0 

-0.5318 

0.8902 

-0.0558 

0.3025 

0.298 

+0.005 


* Ionic strength or concentration in moles per liter. 

f These numbers are the sum of reference system, one-mode, and two-mode contributions. They contain 
computational errors in the fourth decimal place. 

\ Calculated from osmotic coefficients and activity coefficients obtained from solutions of the hypernetted- 
chain equation. 

Source : D. Chandler and H. C. Andersen, /. Chem . Phys., 54, p. 26, 1971. 


Table 15-3. Mean ionic activity coefficient of a model ionic solution 


7* 

lny±t 

HNC In y ± t 

difference 


-0.2104 


-0.0003 

0.2 



-0.0029 

0.3 

-0.2275 

-0.2221 

-0.0054 

0.5 

-0.1854 

-0.1776 

-0.0078 

0.7 

-0.1186 

-0.1099 

-0.0087 

0.8 

-0.0789 

-0.0704 

-0.0085 

0.9 

-0.0358 

-0.0277 

-0.0081 

1.0 

+0.0104 

+0.0181 

-0.0077 


* Ionic strength or concentration in moles per liter. 

f These results are the sum of reference system, one-mode, and two-mode contributions. They contain 
computational errors in the fourth decimal place. 

\ Obtained from solutions of the hypernetted-chain equation. 

Source: D. Chandler and H. C. Andersen, /. Chem. Phys., 54, p. 26, 1971. 


Table 15-4. Thermodynamic properties of the restricted primitive model of a 1-1 electrolyte solution: 
T= 298.0°K, e = 78.5, and d= 4.25 A 


m 

h* 

^Mct 

(In y±ht 

(In t±)mc§ 

(~E/2NkT) 2 1 

(-£/2A%T)mc# 

0.00911 

0.9706 

0.9701+0.0008 

-0.0957 

-0.0973 

0.0993 

0.1029 ±0.0013 

0.10376 

0.9451 

0.9445 + 0.0012 

-0.2273 

-0.2311 

0.2678 

0.2739 + 0.0014 

0.42502 

0.9786 

0.9774+0.0046 

-0.2587 

-0.2643 

0.4285 

0.4341+0.0017 

1.0001 

1.0906 

1.094 ±0.005 

-0.1263 

-0.1265 

0.5472 

0.5516 + 0.0016 

1.9676 

1.3426 

1.346 ±0.009 

+0.2587 

+0.254 

0.6519 

0.6511+0.0020 


* The mode expansion osmotic coefficient, including the two-mode contribution, 

t Monte Carlo osmotic coefficient. 

X The mode expansion logarithm of the mean ionic activity coefficient, including the two-mode contribution. 

§ The Monte Carlo result. The standard error for these results are only slightly larger than those for the 
osmotic coefficient at the same concentration. 

|] Mode expansion result, including the two-mode contribution, for the negative reduced excess internal 
energy per ion. 

# The Monte Carlo result, with standard errors. 

Source: H. C. Andersen and D. Chandler, /. Chem. Phys., 55, p. 1497, 1971. 
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In a second series of three papers,* Andersen and Chandler further investigate the 
concept of the appropriate choice of perturbation potential. They start by transforming 
the usual Mayer diagrammatic expansion for the Helmholtz free energy and pair cor- 
relation function of a multicomponent fluid using certain topological reduction tech- 
niques that are beyond the level of this book. The result of this analysis is that the 
diagrams are expressed in terms of a so-called renormalized potential #(r) instead of 
u(r ) itself. This renormalized potential is defined as a sum of certain kinds of diagrams 
and, when evaluated for a 1-1 restricted primitive electrolyte, turns out to be 

%j<r) = J *"*" #*> A 0 5 -"> 

where t(k) = — Pti(k)l[l + 2ficti(k)\ and fi(k) is the Fourier transform of the optimized 
potential, which is coulombic outside the hard core and chosen such that g(r) = 0 
within. The expansion of the excess Helmholtz free energy is 

s/ = «^orpa + B 2 + S (1 5-1 00) 

where 

B 2=il Pi Pj J dr {(«*“ - 1 - V U )0 U 0 - Wij 2 } 

They also show that the radial distribution function can be written as 

9 ij(r) = g^ir )expft?y(r )] + 5' (1 5-1 01 ) 

In Eqs. (15-100) and (15-101) S and S' are the sum of all the diagrams not included 
in the first terms. 

When the functional optimization described before is applied, the result is that the 
contributions of S and S' are negligible. Equations (15-100) and (15-101) with S and 
S' equal to zero and the potential optimized are referred to as the ORPA + B 2 and 


Table 15-5. Thermodynamic properties for the restricted primitive model for zy~z 2 electrolyte solu- 
tions: T- 298.16°K, e = 78.358, and rf- 4.2 A 


charge 

type 



- AE/NkT 



4> 


c„ 

ORPA 4- B 2 

HNC 

MSM 

ORPA + B 2 

HNC 

MSM 

2-1 

0.005 

(±0.001) 

0.265 

0.2631 

0.2479 

(±0.001) 

0.923 

0.9239 

0.9248 

2-1 

0.05 

0.644 

0.6324 

0.6130 

0.847 

0.8502 

0.8466 

2-1 

0.2 

0.973 

0.9492 

0.9489 

0.830 

0.8358 

0.8299 

2-1 

1.0 

1.432 

1.3870 

1.4059 

1.041 

1.0475 

1.0615 

2-1 

1.3333 

1.523 

1.4730 

1.4899 

1.167 

1.1778 

1.1994 

3-1 

0.005 

0.585 

0.5638 

0.4981 

0.849 

0.8530 

0.8520 

3-1 

0.01 

0.759 

0.7279 

0.6568 

0.812 

0.8186 

0.8130 

3-1 

0.1 

1.486 

1.4178 

1.4233 

0.708 

0.7242 

0.6960 

3-1 

0.5 

2.124 

2.0023 

2.1088 

0.788 

0.8065 

0.7981 

3-1 

1.0 

2.433 

2.2830 

2.4113 

1011 

1.0264 

1.0544 

2-2 

0.005 

1.055 

0.8158 

0.5607 

0.785 

0.8077 

0.8294 

2-2 

0.0625 

2.062 

1.737 

1.4554 

0.643 

0.6452 

0.6303 

2-2 

0.5625 

2.916 

2.757 

2.6438 

9.595 

0.6028 

0.5732 

2-2 

2.0 

3.579 

3.396 

3.3799 

0.857 

0.8865 

0.9112 


Source: S. Hudson and H. C. Andersen,/. Chem. Phys ., 60, p. 2188, 1974. 


* H. C. Andersen and D. Chandler, /. Chem . Phys ., 57, p. 1918, 1972; 57, p. 1930, 1972; 57, p. 2626, 1972. 
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exponential approximation (EXP), respectively. These approximations have been ap- 
plied to z x -z 2 electrolytes by Hudson and Andersen,* and the results are shown in 
Table 15-5 where it can be seen that the agreement among the various theories is quite 
good. This is some indication, however, that the agreement does begin to break down 
as z y and z 2 increase. All of the recent advances described in this section are discussed 
very well by Rasaiah (in “Additional Reading”). 

ADDITIONAL READING 

Falkenhagen, H. 1934. Electrolytes . London: Oxford University Press. 

, and Kelbg, G. Mod. Aspects Electrochem., 2, p. 1, 1959. 

Fowler, R. H., and Guggenheim, E. A. 1956. Statistical thermodynamics. Cambridge: Cambridge 
University Press. 

Friedman, H. L. 1962. Ionic solution theory . New York: Interscience. 

. Mod. Aspects Electrochem., 6, p. 1, 1970. 

— . J. Solution Chem. 9 1, p. 387, 1972. 

Guggenheim, E. A., and Stokes, R. H. 1969. Equilibrium properties of aqueous solutions of single 
strong electrolytes. New York: Pergamon. 

Olivares, W., and McQuarrie, D. A. Biophys. 15, p. 143, 1975. 

Rasaiah, J. C. J. Solution Chem. t 2, p. 301, 1973. 

Resibois, P. M. V. 1968. Electrolyte theory. New York: Harper & Row. 

Rice, S. A., and Nagasawa, M. 1961. Polyelectrolyte solutions. New York: Academic. 

Robinson, R. A., and Stokes, R. H. 1965. Electrolyte solutions. 2nd. ed. New York: Academic. 


PROBLEMS 

15-1. Show that the second virial coefficient diverges for a coulombic potential. 

15-2. Show that Eq. (15-7) is a solution to Poisson’s equation. To do this you need to 
know that V 2 (l/r) = 8(r). 

15-3. Using the divergence theorem from vector calculus, derive Poisson’s equation from 
Gauss’ law. 

15-4. Use Gauss’ law to show that the electric field due to an infinite uniformly charged 
plane is 47 to, where <j is the surface charge density. 

15-5. Calculate the electric field within a sphere bounded by a uniformly charged surface. 

15-6. Use Gauss’ law to show that the normal component of D = eE on crossing a charged 
surface has a discontinuity equal to 47 t<t, where a is the surface charge density. 

15-7. Use Gauss’ law to show that the electric field due to an infinite uniformly charged 
line of linear charge density e is lejr. What does this say about the potential? Is there any 
difficulty in choosing the zero of energy? 

15-8. Derive Eq. (15-16), i.e., that <^> =(dAldqj). 

15-9. Show that the Debye-Hiickel differential equation, Eq. (15-32), 



can be written as 


1 d 2 


This is a standard substitution in handling a spherically symmetric Laplacian operator. 


* S. Hudson and H. C. Andersen, /. Chem. Phys. t 60, p. 2188, 1974. 



352 SOLUTIONS OF STRONG ELECTROLYTES 


15-10. The charge density around the yth central ion in the Debye-Hiickel theory is 
nj(r) = — 


zjeK 2 e K<r a) 


47rr(l + tea) 

Find the most probable value and the mean value of r in terms of /c. 

15-11. Calculate the mean thickness of the Debye-Hiickel ionic atmosphere and ionic 
strength of a 0.1 and 0.01 M solution of a 1-1 electrolyte in (a) water at 25°C (e = 78), and 
(b) methanol at 25°C (e = 31.5). 

15-12. Derive Eq. (15-55). 

15-13. Show that for small kq 9 t(ko) = 1 — \ko H . 

15-14. Derive Eq. (15-59) from a generalization of Eq. (13-34). 

15-15. Show that J0 p* 1 = — K 3 a(Ka)/24n 9 where o(ko) is given by Eq. (15-63). 

15-16. Show that o(k a) in the previous problem becomes 1 — f/cfl for small ko. 

15-17. Show that the electroneutrality condition allows one to write 

v+q + 2 + v-q - 2 

Use this to derive Eq. (15-77) from Eq. (15-76). 

15-18. Show that the Debye-Hiickel expression for <^j>, i.e., Eq. (15-43), satisfies the 
symmetry condition of Eq. (15-47). 

15-19. Show that k~ 1 = 3.04c“ 1/2 A for a 1-1 aqueous electrolyte solution at 25°C when 
c is the concentration of salt expressed in moles per liter. 

15-20. Show that the linearization of the exponents that leads to the Debye-Hiickel 
equation is valid for concentrations less than around 0.01 M for a 1-1 aqueous electrolyte 
solution. 

15-21. The mean molar activity coefficients of NaCl at 25°C are 


0.001 M 
0.966 


0.005 M 
0.929 


0.01 M 
0.904 


0.05 M 
0.823 


0.1 M 
0.778 


0.5 M 
0.682 


1.0 M 
0.685 


2.0 M 
0.671 


Calculate y± according to the Debye-Hiickel theory and compare the result to the above 
data. 

15-22. Using the Debye-Hiickel theory, calculate the mean ionic activity coefficient of a 

1- 1 aqueous electrolyte solution at 25 C as a function of concentration. Do the same for a 

2- 2 electrolyte. Compare the results to those in Table 15-3. 

15-23. Calculate the corresponding entries in Table 15-4 for the Debye-Hiickel theory, 
with and without the ko terms. 

15-24. Show that the solution to the Debye-Hiickel differential equation for the case in 
which the charge q of the central ion is uniformly distributed over its surface is 


<r <a 


q I Kd \ 

^ r)= TA x -T^r - 1 0 

ea\ 1 + kq] 

Using the equation 

^.. = fV<?w 

Jo 

for the electrostatic free energy of the central ion, show that 



PROBLEMS 353 


Now show that if the fixed central ion has radius b and it is immersed in a solution of mobile 
ions whose centers can approach the center of the sphere to within a distance a (i.e., a — b 4- 
the radius of the mobile ions), then 



Show that this result is independent of the dielectric constant of the central ion (see Tanford 
in “Additional Reading”). 


15-25. Show that the radial distribution functions of the Debye-Hiickel theory are given 
by 


fftAr) = 1 


QtQj e— 
ekT r 


(in the limiting law limit of ko -► 0). Interpret this equation. Show how this is consistent with 
the expression for the net charge density in the ionic atmosphere, Eq. (15-51). 

15-26. Derive expressions for all the thermodynamic properties of an ionic solution from 
the Debye-Hiickel radial distribution functions given in Problem 15-25 and 


3) = P 2 C ‘ C J f SiMu^r^-rrr 1 dr 
l,J= 1 J c u 


where 


Uij= co 

QiQj 

er 


a u 
r <a tJ 

r>a ti 


Hint: Recall the Gibbs-Helmholtz equation for A in terms of a temperature integration 
over J8. 

15-27. Show that in terms of the summation function S defined by Eq. (15-82), the 
equation state is given by 


p SS 

— Ci + c 2 + S ci — C 2 
kT dci 


as 

dc 2 


15-28. Derive expressions for the osmotic coefficient <f> and the logarithm of the activity 
coefficient In y± in terms of si = —p A ex /V. 

15-29. Show the Eq. (15-88) for f tj is simply an identity. 

15-30. Show that the summation over the set of graphs labeled by p{ 1) in Fig. 15-5 gives 
no contribution to the thermodynamic properties because of electroneutrality. 

15-31. Show that the contribution to S due to the cyclic graphs is 

fC 3 

O(cycllc) — 

1277- 

15-32. Show that the Fourier transform of Eq. (15-95) is given by Eq. (15-96). 

15-33. Use either the Kirkwood or Bom-Green-Yvon equation to show that the Debye- 
Hiickel theory is an exact limiting law. 

15-34. An apparently excellent approximation for the interionic radial distribution 
functions is to write 

g,j(r)=g,j HS (r)e-* t,H "> kT 

where 

QiQ je-« r - mi 


<f> DH (r ) 


er(l + ko) 
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Using this approximation in the case in which all the ions have the same size, calculate J8 E 
and all the other thermodynamic properties of a 1-1 electrolyte by the method outlined in 
Problem 15-26. Compare your results to Tables 15-3 and 15-4. Use the Grundke-Henderson 
program for # HS (r) given in Appendix D.* 

15-35. When an electrode or any charged surface is immersed in an ionic solution, an 
ionic atmosphere is formed next to the electrode. Since the net charge of the ionic atmosphere 
is opposite that of the electrode, one can picture the electrode and its ionic atmosphere as 
two oppositely charged planes separated by a distance around l//c. Hence this is called a 
diffuse double layer and occurs whenever a large charged body is immersed in an ionic 
solution. The theory of the diffuse double layer was given by Gouy and Chapman and is 
now called the Gouy-Chapman theory. They start with the nonlinear Poisson-Boltzmann 
equation, which in this case depends only upon one variable x 9 the distance from the charged 
planar surface. If we let the potential be <j> 9 then we have 


d 2 <f> 

fa 2 


— Z c *<I' e 




Show that for the case of a symmetrical binary electrolyte, i.e., one in which q+ — — q_ = q 9 
this equation becomes 


d 2 <f> 

dx 2 


8 irqc 
e 


sinhQS q</>) 


Let <j>* = fiq<j> and k 2 %Trq 2 cjekT 9 and write this equation in the reduced form 


d 2 6 * 

sinh 4>* 


where £ kx. Using the identity 


2 dx \dxj \dx) \dx 2 ) 


integrate this equation once to get 


ckf>* 

- -(2 cosh </>* - 2) ,/2 - 2 sinh 

where the fact that <j> and d<j>jdx^ 0 as x-> oo has been used. If we let the electrode have a 
fixed potential <j > 0 , integrate once more to get 



e < f >*/2 


A^+l+^V 72 -!)*-* 


This is called the Gouy-Chapman equation. Plot <j> versus x and show that <j> decreases 
roughly exponentially as a function of x. Show that if the Poisson-Boltzmann is linearized 
by replacing sinh(£ q<f>) by f$q<f>, then 
</>(x) ^<f>oe~ KX 

Compare this to the Gouy-Chapman equation as a function of concentration. (The work 
by Verwey and Overbeek in “Additional Reading” is the classic reference for the Gouy- 
Chapman theory.) 

15-36. By starting with the fact that the total charge surrounding a central ion must be 
equal in magnitude but of opposite sign to charge on the central ion, i.e., with 

f°° 

Cj\ giAryhrr 1 dr - q, 

i J 0 


* See W. Olivares and D. A. McQuarrie, Biophys. J ., 15, p. 143, 1975. 
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show that 


2 Qi c * Qi C J dr = 

t.i J o 


EkTK 2 

4tt 


is an expression of the condition of electroneutrality. This is an interesting way to write this 
equation since Stillinger and Lovett* have derived the additional condition 


1,‘ltctgjcj 


f SiAr^r* 
Jo 


dr = 


3 ekT 
2tt 


This is now called the second-moment condition. The condition of electroneutrality can be 
thought of as the zero-moment condition. Show that gu DH (r ) satisfies both moment conditions. 

15-37. In 1966, Lebowitz and Percusf derived an approximate integral equation, which 
applied to simple liquids or ionic solutions, is now called the mean spherical model and 
consists of the conditions 


0i Jf) — 0 r<R (J 

ct/jr) - —P w,//*) r > R tJ 

along with the Ornstein-Zemicke equation. Note that the first condition is exact for a potential 
with a hard-core repulsive part (such as in the primitive model of electrolytes) and that the 
approximation consists of setting c t j(r) = — Put/j) outside of the hard-core diameter. Using 
the Ornstein-Zemicke equation, one can then solve for gtj{r) outside of R tJ and Ci/r) inside 
Rij. For a strictly hard-sphere potential, u tJ — 0 for r>R tJ and so c tJ (f) = 0 there also. 
Show that in this case, the mean spherical model is the same as the Percus-Yevick equation 
for hard spheres. Since the mean spherical model, then, is an extension of the hard-sphere 
Percus-Yevick equation, it is not surprising that it can be solved analytically for certain 
u t /r), and Waisman and LebowitzJ have solved it for the restricted primitive model. They 
find that for a 1-1 electrolyte that 

CiAr) - c tJ \r) - r<R 

fahVj ^ „ 

= r > R 

er 


where c u °(r) is the direct correlation function of a system of uncharged hard spheres. R is 
the diameter of the ions (restricted primitive model), and 

B = x- 2 [x 2 + x-x( 1 -h 2 *) ,/2 ] 
where 



Waisman and Lebowitz§ show that c tJ (r ) can be used directly to give 


E' x (x, P) = 


x 2 + x — x(l + 2*) 1/2 
4t tPR* 


* /. Chem. Phys ., 49, p. 1991, 1968. 
t Phys, Rev,, 144, p. 251, 1966. 

t J. Chem, Phys., 52, p. 4307, 1970; 56, p. 3086, 1972 ; 56, p. 3093, 1972. 
§/. Chem. Phys., 56, p. 3086, 1972. 
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Using the Gibbs-Helmholtz equation, show that 
(A - A 0 ) p* 1 

= - (\ 2 ttR *)-\ 6 x + 3 x 2 ± 2 - 2(1 ± 2 *) 3 ' 2 ] 
Show from this that the osmotic coefficient is 


4 >e - tf >° ± [ 4 t rCc, + c 2 )/? 3 ]- 1 [ a : + x(l + 2*) 1 ' 2 - §(1 + 2*) 3 ' 2 + §] 

where the subscript E indicates that <f> is calculated through the equation for E cx (x, £). 
Show also that the logarithm of the mean activity coefficient is 


In y± eI = 


x(l + 2 x ) 1 / 2 - x - x : 


4tt(ci -f c 2 )R* J 

Table 15-6 gives a comparison of this equation to several other approaches. 


Table 15-6. Comparison of the osmotic coefficient calculated by various theories* 


c 

(moles/liter) 

& mc 

^ hnc 

& hnc 

4>J YA 

<f>c rVA 

A msm 

<pv 

4>^ M 

^MEX 

0.00911 

0.9701 ± 0.0008 

0.9703 

0.9705 

0.9703 

0.9705 

0.9687 

0.9709 

0.9707 

0.10376 

0.9445 ±0.0012 

0.9453 

0.9458 

0.9452 

0.9461 

0.9312 

0.9454 

0.9452 

0.42502 

0.9774 ±0.0046 

0.9796 

0.9800 

0.9765 

0.9844 

0.9446 

0.9806 

0.9787 

1.0001 

1.094 ±0.005 

1.0926 

1.0906 

1.0789 

1.1076 

1.039 

1.097 

1.091 

1.9676 

1.346 ±0.009 

1.3514 

1.3404 

1.3114 

1.386f 

1.2757 

1.3595 

1.342f 


* For these calculations, a= 4.25 A, e= 78.5, and T— 298 C K. 
t There is some uncertainty in the last digit. 


Source: J. C. Rasaiah, /. Solution Chern., 2, p. 301, 1973. 


The mean spherical model has been solved analytically for fluids of hard spheres with 
permanent electric dipole moments by Wertheim* and for a mixture of such particles by 
Adelman and Deutch.f 

15-38. Plot In y ± el of the mean spherical model (see previous problem) as a function of 
x = (87re 2 c)S/? 2 /6) 1/2 for an aqueous solution of 1-1 electrolyte at 25 C C. Plot on the same 
graph In y± DH and In y± DHLL , where DH denotes Debye-Hiickel theory with the kq terms 
retained. 

15-39. Using the analytic expression for In y ± el in the mean spherical model (Problem 
15 - 37 ), show that in the limit jc -*0 one obtains the Debye-Hiickel limiting law. Note that 
In y ± el =ln y ± — lny 1 * is the electrostatic contribution to the logarithm of the activity 
coefficient. 


* /. Chem. Phys ., 55, p. 4291, 1971. 
t /. Chem, Phys., 59, p. 3971, 1973. 




CHAPTER 16 


KINETIC THEORY 
OF GASES AND 
MOLECULAR COLLISIONS 


In the first 15 chapters we have discussed only systems that are in thermodynamic 
equilibrium. We started with simple systems such as ideal gases, where the intermolec- 
ular potential can be neglected, and progressed to more complicated systems where 
the intermolecular potential plays a key role. In the next seven chapters we shall study 
the molecular theory of transport through gases and liquids. This is now a very active 
area of research, and it has been only in the 1960s that this field has been set on a basis 
comparable to equilibrium statistical mechanics. There are still a number of tricky 
conceptual problems that rise up now and then, but on the whole and within the 
level of this book, nonequilibrium or irreversible statistical mechanics is only slightly 
more complicated than equilibrium statistical mechanics (statistical thermodynamics). 

In this chapter we shall first review the elementary kinetic theory of gases and 
molecular collisions. In the next chapter we shall discuss the macroscopic equations 
of continuum mechanics or hydrodynamics, such as the continuity equation, the 
macroscopic momentum balance equation, and energy balance equation. This is essen- 
tially the “thermodynamic 55 background for nonequilibrium statistical mechanics. 
In the next chapter we shall review the derivation of the most fundamental equation 
of statistical mechanics, namely, the Liouville equation, and then introduce the various 
distribution functions and flux vectors. Then we shall derive the fundamental equation 
of change of dilute gases, the Boltzmann equation. We shall discuss some of its direct 
consequences, and then the standard method of solving this integrodifferential equa- 
tion, the so-called Chapman-Enskog method. Then we shall present the main results 
of the Chapman-Enskog theory and make an extensive comparison to experimental 
data. In the remaining chapters we shall study several approaches to the transport 
theory of liquids ending with what is probably the most successful and well accepted 
to date, the time-correlation-function method. Before going on to study these various 
theories of transport in gases and liquids, however, let us review some elementary 
kinetic theory of gases, since that is what we are about to generalize and put into a 
general statistical mechanical framework. 
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16-1 ELEMENTARY KINETIC THEORY OF TRANSPORT IN GASES 

In this section we shall discuss an elementary kinetic theory treatment of transport 
phenomena in dilute gases. We shall derive expressions for the principal thermal 
transport coefficients, namely, the coefficients of diffusion, viscosity, and thermal con- 
ductivity. The kinetic theory of transport in gases can be approached from a number 
of levels, varying from very simple treatments, in which all the molecules are assumed 
to have the same (average) velocity and move only in the x- 9 y- 9 and z-directions, to 
very sophisticated treatments which make no unnecessary assumptions. An interesting 
thing is that the resu Its of these various theories differ only by constant factors of the 
order of unity. One can generate pages of algebra to get a more exact equation which 
may have the same form as the simple equation but introduce a missing factor like $. 
In order to introduce the basic ideas of transport in gases, we shall use a treatment 
intermediate in rigor. There are many texts dealing with the kinetic theory of gases. 
Several of these are given in the “Additional Readings.” 

Recall from Chapter 7 that the distribution of molecular velocities in a one-com- 
ponent gas of molecules with mass m is given by the Maxwell-Boltzmann distribution: 

f(v x , v y , v z ) dv x dv y dv z = ex p{ ~ 2 If ^ + Vy + dVx dVy dVz 

or 

m * . 4.^)" »» expj- * (16-1) 

where v 2 = v 2 + v 2 + v 2 . Some average values derived from these distributions are 
(cf. Problems 7-15 through 7-24) 



(v 2 ) 1/2 = root-mean-square velocity = 


3 kT \ 1/2 


(*) 


( 16 - 2 ) 


Note that the values of v 2 9 v 2 y v 2 9 and v 2 are the results expected from equipartition 
(Chapter 7). 

We now introduce the idea of the mean free path. In general, the average distance 
traveled by a molecule between collisions cannot be well defined since a collision itself 
cannot be unambiguously defined. We can get a good idea of this quantity, however, 
if we use rigid spheres as a guide. Consider a molecule with diameter c moving in the 
jc-direction. Since there will be a collision if the center of our one molecule comes 
within a distance u of the center of one of the other molecules in the gas, each of these 
molecules presents a target of effective diameter 2 a 9 and hence of area no 2 . This is 
shown in Fig. 16-1. The number of such targets in a plane of unit area perpendicular 
to the ^-direction and of thickness dx is p dx 9 where p is the number density of mole- 
cules in the gas. Neglecting overlap, the total target area presented by these molecules 
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Figure 16-1 . An illustration of why the effective collision diameter is 2 o instead of just cr itself. 

is na 2 p dx. The probability that our one molecule will suffer a collision is then the 
ratio of this area to the total area (1 cm 2 ): 

probability of a collision = na 2 p dx (1 6-3) 

Now consider a beam of n 0 molecules with approximately equal velocities in the 
positive x-direction. Let them all start at x = 0. The number of molecules that undergo 
a collision between x and x 4- dx is the number of molecules reaching x , nipc), multi- 
plied by the probability of a collision in dx, nap dx. We can write, then, 

n(x + dx) — n(x) = — na 2 pn(x) dx 


or 


dn 

dx 


= —na 2 pn(x) 


whose solution is 


n(x) — n 0 e n ° 2px (1 6-4) 

The mean distance traveled is obtained by multiplying the distance x by dn(x), the 
number of molecules colliding between x and x 4- dx, integrating over all values of x, 
and then dividing by the total number of molecules. If we denote the mean free path 
by /, then 

, If" 0 , 1 r 00 dn _ 1 

Z = — x dn = x— dx = — =— (16-5) 

n 0 Jo n o J o dx na*p 

which is the result we were after. This differs by a numerical factor of order unity from 
a more rigorous derivation, and we shall see in Section 16-3 that the correct expression 
for / is \/2 1/2 na 2 p. 

We can get a feel for the order of magnitude of the mean free path by using the ideal 
gas equation to eliminate p, giving 


/ 


kT 

na 2 p 


( 1 6 - 6 ) 


At 300°K and for a = 5.0 A, / = 5 x 10“ 6 cm for a pressure of one atmosphere and 
4 x 10“ 3 cm for a pressure of 1 mm Hg. The average time between collisions is ob- 
tained by dividing / by the average velocity. At standard conditions, this number is 
approximately 10“ 10 sec, i.e., one molecule of a gas at standard conditions undergoes 
about 10 10 collisions per second. 
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We shall now derive expressions for the coefficients of viscosity, thermal conductivity, 
and diffusion for a dilute gas. We shall derive all of these quantities by considering 
the transport of some molecular quantity from one region to another. Viscosity in- 
volves the transport of momentum; diffusion involves the transport of mass; and ther- 
mal conductivity involves the transport of energy. We shall derive these quantities 
somewhat nonrigorously, but our final results will be correct to within numerical fac- 
tors of order unity. In later chapters we shall rederive these results rigorously. The 
derivation we use is called the mean free path theory of transport since the idea of the 
mean free path will be of central importance. Let us consider viscosity first. 

The usual mechanical setup in an experimental determination of the viscosity of 
gases is a pair of parallel plates a fixed distance a from each other. This is shown in 
Fig. 16-2. Let the z-axis be perpendicular to these plates, with one plate located at 
z = 0 and one at z = a. The lower plate is kept at rest, and the upper one is moving 
with a uniform velocity U in the x-direction. If the distance between the plates is large 
compared to the mean free path, the layer of gas next to the plates assumes the same 
velocity as the plates themselves. Because of this, the gas has v x = f/atz = a and v x = 0 
at z = 0. The average x-component of the velocity between the plates is denoted by 
i/(z). We now assume that i/(z) varies linearly with z so that u(z) = Uzja . The average 
momentum at the height z will be 


G(z) = 


mUz 

a 


(16-7) 


where m is the mass of a molecule. 

Although equally many molecules from above and below reach the height z per 
unit time, the ones coming from above will, on the average, carry more momentum 
than the ones from below. Consequently there will be a downward flow of momentum 
through any horizontal plane. Let ^(z) be this flow per square centimeter per second. 

The number of molecules per unit time with velocity v = (v x , v y , v 2 ) passing through 
a unit horizontal plane located at z is equal to the number of molecules initially located 
within a parallelepiped with a 1 cm 2 base and a height | v 2 | . The volume is then | v 2 \ . 
The average number of molecules in this parallelepiped is | v z \pf(v x , v y , v 2 ). If v 2 is 
positive, the particles come from below, and if v 2 is negative, they come from above. 
Since a +v 2 occurs as often as a — v 2 , the net flow through the surface is zero, indica- 
ting a steady state. 

The molecules passing through the unit horizontal plane at z have traveled in a 
straight path since undergoing their last collision and, on the average, have traveled 
a distance /. The last collision of a molecule with velocity v, therefore, occurred at a 
height z' = z — v 2 ijv . Note that if v 2 is positive, the last collision occurs from below, 
and if v 2 is negative, it occurs from above. 

We now make the assumption that during the last collision the molecule has come 
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Figure 16-2. The geometry involved In the experimental setup used to measure the viscosity of a gas. 
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into local equilibrium with its environment and so carries the momentum associated 
with that height, i.e., z . This momentum is 

G(z') = g(z~— \ = G(z)~ — ^+"- (16-8) 

\ v J v dz 

where we have expanded G(z') about G(z). The net flow of momentum G is given by 
integrating the product of G(z') and the number of molecules crossing the surface from 
the height z' over all velocities: 

oo 

= JJJ G(z')v z pf(v x , v y , v z ) dv x dv y dv z (1 6-9) 

-oo 

Substituting Eq. (16-8) for G(z') into this gives the sum of two integrals, the first of 
which is zero and the second is 


i]/(z) = f(v x > Vy . v z ) dv x dv y dv 2 


( 16 - 10 ) 


Since there is nonzero average velocity in the ^-direction, the distribution of molecular 
velocities in this integral is given by 

f(v x , v y , v z ) = exp J - ~ [(p, - u(z)) 2 + v y 2 + v 2 ] J 

If u is small compared to the molecular velocities, however, we can simply ignore the 
i/(z) in the exponent to get 

dG 

il/(z)= -$lpv— ( 16 - 11 ) 


The minus sign indicates that the flow is in the direction from higher G to lower G . 
Recall now that G(z) = milz/a. Substituting this into Eq. (16-11) gives 

\J/ = —\pmlv — ( 1 6 - 1 2 ) 

a 

which we can write as 

= —%pmlv grad u (1 6-1 3) 

Since ip represents the rate of momentum transfer, it is really the viscous force acting 
upon the upper plate. The frictional force per unit area of surface acting on the upper 
plate is usually given by the macroscopic law (Newton’s law of viscous flow) 

F = —rj grad u (1 6-1 4) 

where rj is just a proportionality constant called the coefficient of viscosity. Comparison 
of Eqs. (16-13) and (16-14) gives 


r\ = %pmlv ( 16 - 15 ) 

Using Eq. (16-5) for / and Eq. (16-2) for v gives 
1 /2\ 3/2 0 mkT Y 12 

,a 3W 


( 16 - 16 ) 
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This equation predicts that the coefficient of viscosity is independent of the density, 
or pressure, and is a function of temperature only. This result was first predicted by 
Maxwell and has since been demonstrated experimentally over a wide range of 
pressures. 

Notice that we did not need to specify the form of G(z) to derive Eq. (16-11). We 
can consider \J/(z) to be the flow of some quantity G(z) per second per unit area through 
a plane parallel to the xy plane located at a height z. When G(z) was specified to the 
momentum, ^(z) became the momentum transfer or drag on the moving plate. Suppose 
then that both plates are stationary, but now there is a temperature gradient because 
the plates are fixed at different temperatures. The average energy per molecule e will 
vary with height, i.e., 

G(z) = a(z) (16-17) 


dG 

dz 


dedT 

dT~dz 


-l 8radr 


(1 6-1 8) 


where N 0 is Avogadro’s number and C v is the molar heat capacity. Substitute this 
into Eq. (16-11) to get the heat flow per unit area through a horizontal plane 


'Kz) = ~ipl irr v grad T 
No 


(1 6-1 9) 


The macroscopic law here is Fourier’s heat law, which says that the heat flow is pro- 
portional to the temperature gradient. This law is usually written in the form 

\J/(z) = — A grad T (1 6-20) 

where A is called the thermal conductivity. Comparing Eq. (16-19) with Eq. (16-20) 
gives 

A^ilpv^ (16-21) 

N 0 


A comparison of this equation with Eq. (16-15) for r\ shows that 


= yCy 
M 


( 1 6 - 22 ) 


where Af is the molecular weight of the gas. Let CJM = c„, the specific heat of the 
substance. Equation (16-22) predicts that rjcJA = 1. A more refined theory still pre- 
dicts that this ratio should be a constant, but somewhat different from unity. Table 
16-1 gives this ratio for several gases. 

So far we have studied the flow of momentum and the flow of energy. This led to 
expressions for the coefficient of viscosity and the thermal conductivity. Lastly, we shall 
study diffusion, which is the net transport of molecules through a surface. The macro- 
scopic law in this case is Fick’s law, which says that the flow of molecules of a certain 
type through a gas at uniform pressure is proportional to the concentration gradient 
of these molecules, or 

flow = — Dj grad p x (16-23) 

The proportionality constant is called the diffusion coefficient. We shall now derive a 
kinetic theory expression for D x . In order to do this we must consider a two-com- 
ponent system. 
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Let us set up an idealized experimental system consisting of a tube filled with a 
mixture of two gases of kinds 1 and 2. Let the axis of the tube be the z-direction. The 
densities of the two gases are functions of z only. In order to look at a flow that is 
due only to a concentration gradient, and not a pressure or temperature gradient, 
we must have the total density be the same for all z, i.e., 

Pi(z) + Pi( z ) = p = constant 

Each component has its own distribution of velocities, and so we can write 
Pi(v, z) = PiOO/iOO p 2 (v, z) = p 2 (z)/ 2 (v) 

Let us now calculate the flow of molecules of species 1. A particle of species 1 arriving 
at z comes, on the average, from z' = z — v z l x /v, where l x is the mean free path of 
species 1 molecules. The density of these molecules at z' is 

Pl(v, z') = p,(/)/ l(v) 

= PiOO/i(v) - — /j(v) + • • • 

v az 


The number passing through a unit plane perpendicular to the z-axis in the +z-direc- 
tion per second is then , 

v z Pi(y, z') = Pi(zK/i(v) - /i(v) + • • • 

v az 


The net flow is obtained by integrating this over all velocities, giving 
» °° 2 

iMz) = ~h ~f£ Vy, v z ) dv x dv y dv z 

~ GO 

or 


^i( z ) = - ili»i g rad Pi 0 6-24) 

Comparison of this to Fick’s law gives 

06-25) 

There is a similar expression for the other component. The mean free path l x occurring 
in this equation is the mean free path of species 1 in a mixture. In a mixture of two 
gases, the mean free paths are, to the same degree of approximation as Eq. (16-5) 
(see Problem 16-16), 


i 4 

1 4na l 2 p 1 + nfa + c 2 ) 2 p 2 

1 4 

2 n(o x + c 2 ) 2 Pi + 4nG 2 2 P 2 

Using these values for l x and l 2 , D x and D 2 become 

D _ 8 /2m 1/2 l 

1 3 \nrnj 4nG 2 p x + n(c x + c 2 ) 2 p 2 

D 8 pm 1/2 i 

2 3 \nm 2 j n(G x + g 2 ) 2 Pi + 4nc 2 2 p 2 


( 16 - 26 ) 


( 16 - 27 ) 


( 16 - 28 ) 



364 KINETIC THEORY OF GASES AND MOLECULAR COLLISIONS 


This predicts that the diffusion coefficient of gases should vary inversely with the 
pressure at constant temperature, i.e., diffusion is proportional to the mean free path. 
This is observed experimentally. 

A particularly interesting quantity for theoretical purposes is the coefficient of self- 
diffusion, which is the diffusion of some particular “marked” or “tagged” molecule 
through a gas of otherwise identical molecules. The self-diffusion coefficient is difficult 
to measure experimentally, but can be measured by studying the interdiffusion of 
isotopic species. The self-diffusion coefficient is obtained from Eq. (16-28) by letting 
a x = a 2 and dropping the subscripts to get 



12 \ 3/2 /m 1/2 1 

W \ m ) pc 2 


( 16 - 29 ) 


We can easily derive the relation 
prnD^ i 

n 


( 16 - 30 ) 


by dividing Eq. (16-29) by Eq. (16-15). Table 16-1 tests this ratio. 

We said at the beginning of this chapter that the kinetic molecular theory can be 
approached from various levels of rigor and that the only difference in the results of 
these various treatments was numerical constants of the order of unity. Let us collect 
our results here and compare them to the rigorous expressions for hard spheres. We 
shall derive these rigorous expressions in a later chapter. Our approximate expressions 


are 


^/approx 


v approx 


(mkT) 112 


-I© 


a 

312 /kT' 1/2 


NqO 2 
i/2 | 


1 12 y l2 /kT\ 112 1 
^ox “3 W W 

P m - Papprox 


Papprox 


= 1 


( 16 - 31 ) 

( 16 - 32 ) 

( 16 - 33 ) 
( 1 6 - 34 ) 


Vapprox C v _ i 
^approx 

The rigorous results are 


( 1 6 - 35 ) 


M 

5 

{mkT) 112 

'/rigorous 

“ 16* 1 ' 2 

a 2 

2 

_ 25 tkl 

n \ 1/2 C v 

^rigorous 

32 \m 

n) N 0 a 2 

■^rigorous 

- 3 j 

(kT \ 1/2 1 

8 n 112 ' 

i m ) a 2 p 

piYl ^rigorous 

6 


•^rigorous 

~5 


^/rigorous 

2 


X - 

'’'rigorous 

— 5 



( 16 - 36 ) 
( 1 6 - 37 ) 
( 16 - 38 ) 
( 16 - 39 ) 
( 16 - 40 ) 
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From all this we see that our approximate expressions differ from the exact expressions 
by only numerical factors. We see that »/, pprox /»/ rigoroos = 32^2/1571 = 0.96; A approx / 

^■rigorous = 64^2/75* = 0.38; Z) approx /Z) rigorous = 16^2/9* = 0.80. 

Remember that these are hard-sphere expressions. Even the exact results do not 
give the observed temperature dependence of the various transport coefficients. 
Presumably this is because the effective hard-sphere diameter of real molecules is a 
function of temperature. An early attempt to rectify this deficiency replaced the fixed 
hard-sphere value of o 2 by 


a 2 = cr 0 


2 



(1 6-41 ) 


where S is called the Sutherland constant of the particular molecule. This is often a 
useful way to save the rigorous mean free path expressions for the transport coef- 
ficients, but has little fundamental basis. The real problem is that the concept of a 
mean free path becomes less well defined when there is an attractive potential as in 
the Lennard-Jones potential. In later chapters we shall develop a molecular theory of 
transport in gases without appealing to a mean free path. This will be done through 


Table 16-1. Some results from the mean free path theory of transport 


gas 


experimental* 



calculated from 
the rigorous theory 

TjCjX 

pmD/r) 

Suther- 

land 

con- 

stant 

CK) 

1 } x 10 5 
(g/cm • sec) 

Ax 10 s 

(cal/cm - sec • deg) 

D 

(cm 2 /sec) 

<7 from 7i o from A 

(A) (A) 

a from D 

(A) 

Ne 

29.7 

11.1 

0.45 

2.64 

2.57 

2.41 

0.40 

1.37 

56 

At 

21.0 

3.9 

0.16 

3.67 

3.65 

3.47 

0.40 

1.32 

142 

Kr 

23.3 

2.1 

0.08 

4.20 

4.17 

4.01 

0.40 

1.30 

188 

n 2 

16.6 

5.5 

0.18 

3.75 

4.30 

3.48 

0.53 

1.39 

104 

co 2 

13.7 

3.5 

0.10 

4.64 

5.76 

4.30 

0.60 

1.40 

254 

CH* 

10.3 

7.2 

0.21 

4.15 

4.80 

3.75 

0.53 

1.43 

164 

c 2 h 6 

8.5 

4.4 

— 

5.34 

7.00 

— 

0.69 

— 

252 


* Values at 0°C and 1 atm pressure. 


the so-called Boltzmann transport equation. The result of this development will be 
the derivation of equations that look much like Eqs. (16-36) through (16-38), but 
with an effective collision diameter which is given rigorously in terms of theintermolec- 
ular potential. 

We can use Eqs. (16-36) through (16-38) to calculate a from experimental data. 
All three equations should give the same value for u. Table 16-1 gives a calculated 
from each of them. It can be seen that not only are the values reasonable, i.e., of order 
of 10“ 8 cm, but that they are fairly close to one another. Table 16-1 also contains 
some experimental values of transport coefficients, Sutherland constants, and the ratios 
in Eqs. (16-39) and (16^10). 


16-2 CLASSICAL MECHANICS AND MOLECULAR COLLISIONS 

The rigorous molecular theory of transport that we are going to study depends upon 
the mechanics of the molecular collisions that occur in the gas. In this section then, 
we shall discuss the elementary classical mechanics of collisions. We assume that the 
interaction between the molecules depends only upon the distance between them. 
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Let us consider a collision between two molecules with masses m y and m 2 . Let 
their velocities before the collision be v A and v 2 , and their velocities after the collision 
be v/ and v 2 '. We shall generally use primes to indicate quantities after the collision 
has occurred. If no chemical reaction takes place, we have the conservation of mass 
condition, namely, that m y = m y and m 2 = m 2 . Note that the total mass before the 
collision is equal to the total mass after the collision, 

m x + m 2 = m y + m 2 (1 6-42) 

We, furthermore, have conservation of momentum (as long as there are no external 
forces): 

m i y l + m 2 v 2 = m{v y + m 2 \ 2 (1 6-43) 

Assuming that the energy contained in the internal degrees of freedom is not changed 
because of the collision, the kinetic energy before the collision must equal the kinetic 
energy after 

+ \m 2 v 2 = %m y v \ 2 + \m 2 v 2 2 (1 6-44) 

This defines what is called an elastic collision. Each of these conservation conditions 
is of the form 

*Ai + ^2 = 'I'l + 0 6-45) 

Quantities that obey this equation are called summational invariants. 

The velocities v l3 v 2 , v/, and v 2 ' discussed above are absolute velocities. In terms of 
some arbitrary coordinate system, the positions of the colliding molecules are r y and 
r 2 and the velocities are the time derivatives of these position vectors. As we have seen 
in Chapter 1, it is natural to define 


and 


mpt +m 2 r 2 


m t +m 2 


(1 6-46) 


r = ri -r 2 (16-47) 

These are the center-of-mass coordinates and relative coordinates, respectively. 

If we differentiate r c twice with respect to time, we find 

m t r t + m 2 r 2 F^r) + F 2 (r) 

if c = — — — = — = 0 (16-48) 

m y + m 2 m y + m 2 

since Fj(r) = — F 2 (r) by Newton’s law. Equation (16-48) shows that the center of mass 
of a colliding pair of atoms undergoes uniform rectilinear motion. 

If we differentiate the relative coordinates in the same way, we find that 


r = ri-r 2 = 


F t (r) F 2 (r) 


AW, 


m 2 


or that 



F t (r) 


? = H¥ (r) (16-49) 

where fi = m 1 m 2 /(m 1i + m 2 ) is the reduced mass of the colliding pair. Since F(r) is a 
central force, i.e., is in the same direction as r, Eq. (16-49) is the same as the equation 
of motion of a single particle of mass ^ in a central force field. 
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Figure 16-3. (a) The trajectory of a collision in the relative coordinate system for a potential such as 
the Lennard- Jones potential. In this coordinate system, the particles approach with velocity 
v r and recede with velocity v/. The moving particle is located by the coordinates r and 6 ; r m 
is the minimum valne of r; 6 m is the angle 6 associated with r m ; b is the impact parameter; 
and x is the angle of deflection, (b) The trajectory of a collision in the center-of-mass co- 
ordinate system for a pair of particles with equal masses. The symbols here are the same as 
in (a). Note that the particles approach each other with equal and opposite velocities in this 
coordinate system. 


The vectors r and r define a plane that contains both particles and their center of 
mass. Since F(r) lies in this plane, the relative acceleration ? also lies in this plane, and 
so the entire relative motion takes place in a plane. The entire plane itself is translated 
uniformly with velocity r c . 

In the relative coordinate system, the collision looks as though one of the particles 
is fixed and the other has mass p, initial velocity v r , and final velocity v/. This is shown 
in Fig. 16-3(a), where several other quantities are defined. If there were no interaction 
between the particles at all, b would be the distance of closest approach during the 
collision. This is called the impact parameter of the collision. The r and 6 are nothing 
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but the polar coordinates locating the moving particle; r m is the minimum value of r; 
and 0 m is simply the angle associated with r m . Lastly, the angle x is called the angle of 
deflection and is equal to the angle between v r and v r # . It is also equal to n — 2 6 m . 

The collision would appear quite different to an observer located at the center of 
mass. The velocities of the two particles in the center-of-mass coordinate system are 




and the momenta are 


(1 6-50) 


Pic = 

P2 c=-P*r (16-51 ) 

We see that the two momenta are equal and opposite in this coordinate sytem. This 
collision is shown in Fig. 16-3(b) for a pair of colliding particles with equal masses. 

We can derive a number of important and useful relations. For example, using Eq. 
(16-50) it is easy to show that 

+ \m 2 v 2 2 = M™i + m 2 )v c 2 + \m x v lc 2 + \m 2 v 2c 2 

--- i + m 2 )v c 2 + ifiv, 2 (1 6-52) 

whose interpretations should be obvious. If we denote velocities after the collision 
with a prime and use the fact that the kinetic energy of the center of mass does not 
change, we see that 

ifW 2 = ifw' r 2 (16-53) 

We see from this that the relative velocity changes direction but not magnitude as the 
result of a collision. The particles approach along v r and recede along v/. These are 
labeled in Fig. 16-3(a). One can also show (Problem 16-24) that the magnitudes of 
v lc and v 2c do not change. 

The equations of motion of the two molecules are best written in the polar co- 
ordinate system shown in Fig. 16-3(a). They are most easily obtained by simply writing 
down the conservation of energy and conservation of angular momentum conditions. 
If v r is the initial relative velocity, then conservation of total energy gives 

\gv 2 = ig(r 2 + r 2 6 2 ) + u(r) (1 6-54) 

The left-hand side represents the kinetic energy before or after the collision, i.e., when 
r -* ± oo in Fig. 16-3(a), and the right-hand side is the energy during the collision, 
when u{r) is nonzero. The angular momentum before or after the collision is gbv r , 
and the angular momentum at any time during the collision is fir 2 6, and so we have 

libv r = ixr 2 d (16-55) 

Equations (16-54) and (16-55) are the equations of motion and completely describe 
the collision in terms of v r9 u(r), and b, which characterize the collision. 
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We can now eliminate 6 between these two equations to get one closed equation 
for r(t): 

h*>r 2 = itf 2 + + u(r ) ( 1 6 - 56 ) 


This equation can be thought of as describing the one-dimensional motion of a particle 
of mass n with total energy ifw 2 moving in an effective potential 


«eff = «('•) 


2 r 2 


( 16 - 57 ) 


The only quantity that will enter into our transport equations is the deflection angle 
X . The most direct way to get at this is to solve Eq. (16-56) for drjdt and then divide 
it by dO/dt from Eq. (16-55): 


dr drjdt r* r u(r) b 2 1 1/2 

d6 d6/dt b L i^v 2 r 2 \ 


( 16 - 58 ) 


Let r m and 6 m be the distance and angle at the distance of closest approach. We may 
get 6 m by integrating Eq. (16-58) [cf. Fig. 16-3(a)]: 


bdr 

2u(r) b 2 ] 1/2 

\w 2 r 2 J 

and hence x(i, v r ): 



X(b , v r ) = n-2b\ 

J r m 



dr 

Mr) 

fiv , 2 


by 2 

7 2 _ 


( 1 6 - 59 ) 


( 1 6 - 60 ) 


The lower limit here, the distance of closest approach, is found by setting dr/dO equal 
to zero, i.e., by solving 


W - «(r) 


fW, 


2 b 2 


2 r 


= 0 


( 16 - 61 ) 


for r. 

The transport coefficients of dilute gases all may be expressed in terms of weighted 
integrals of the angle of deflection y(Jb, v r ). For example, the coefficient of viscosity is 
given by (Chapter 19) 

5kT 

= — -s s-s (1 6 - 62) 

87i 1/2 J e y2 y 7 1 J sin 2 x b db \ dy 

where y is a reduced relative velocity defined by y 2 = fw 2 /2kT. We shall see in Chapter 
19 that the other transport coefficients have fairly similar forms. 

The only intermolecular potential functions for which Eq. (16-60) can be integrated 
analytically are the hard-sphere potential and an r“ 4 repulsive potential. This latter 
potential function is not too realistic for most intermolecular interactions, but it is of 
historical interest since Maxwell realized over a hundred years ago that it was possible 
to carry through a rigorous kinetic theory of gases for this potential. Hypothetical 
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molecules that have a repulsive r “ 4 potential are called Maxwellian molecules. For the 
rigid-sphere potential, the distance of the closest approach is a if b <, a and is b if 
b>a. Consequently, Eq. (16-60) can be easily integrated (Problem 16-26): 


x(b, g) — 2 arccos 



b <Lg 


= 0 


b> g 


(1 6-63) 


Note that x is independent of y for a hard-sphere potential. For more realistic poten- 
tials, Eq. (16-60) must be integrated numerically. This has been done for a number of 
potentials, and we shall return to this in Chapter 19. 

In the next and last section of this chapter, we shall calculate the average square of 
the momemtum change for a binary collision. We do this not only because we shall 
need this quantity in Section 21-9, but also because it serves to derive correct expres- 
sions for the mean free path, the total rate of collisions, and the rate of collisions with 
relative velocities in the range v r and v r + dv r and impact parameter in the range 
b + db . 


16-3 MEAN-SQUARE MOMENTUM CHANGE DURING A COLLISION 
The momentum change of some particular molecule upon a collision is 

A^ivj = m^' - m l y i 
= Miiyic ~ Vic) 

= MV-v r ) (16-64) 


where we have used Eq. (16-50). The square of the momentum change is 


(AmjVj) 2 = n\v’ r 2 - 2v r • v/ + v r 2 ) 

= 2n 2 v r \i - cos x) (1 6-65) 

since v r ' = v r and the deflection angle x is the angle between v r and v/. We wish 
to average this over all collisions, and so we need to calculate the collision rate in 
the gas. 

For generality we consider collisions between two dissimilar groups of molecules, 
one having velocity distribution fi(y^ds x and the other having the distribution 
/ 2 (v 2 ) d \ 2 . Let dn x and dn 2 be the number densities of these two groups so that 

dnj = Pjf( y j) dy j 7-1,2 

Now consider a collision between a molecule of the first group and one of the second 
and choose a coordinate system fixed on molecule 2 so that molecule 2 is at rest and 
molecule 1 has velocity between v r and v r + d \ r . The rate at which such collisions occur 
with impact parameter between b and b + db is v r dn t 2nb db . Since there are dn 2 of 
the second kind of molecule per unit volume, the number of such collisions per second 
per unit volume is 

dZ = 2nb db v r dn x dn 2 

dZ = p 1 p 2 2nb db v r f x (y x ) dy x f 2 (y 2 ) ds 2 
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If the molecules of the two groups are the same, we can use this equation as long as we 
divide it by 2 in order to avoid overcounting since the collision with y x = v a , v 2 = Vp 
is the same as that with \ x = , v 2 = v a . Thus we write 

dZ = Kp x p 2 2i ib db v r f x (v x ) dY 1 f 2 (v 2 ) dy 2 (1 6-66) 

where k = \ for collisions between like molecules and k = 1 for collisions between 
unlike molecules. 

If we now substitute the Maxwell-Boltzmann distribution for f x (v x ) and / 2 (u 2 ), Eq. 
(16-66) becomes 

dZ = Kp x p 2 ^™2nkT) 3 Vr ^ n ^ ^ e~ imiVl2+m2V22)/2kT ds x dy 2 (1 6-67) 

Transform from y x and v 2 to center of mass and relative coordinates. The Jacobian 
of this transformation is unity since 

and 

j{v ix v 2x \ \dv lx /dv rx dv lx /dv cx = m 2 IM 1 

tax v rx ) I dv 2 Jdv rx dv 2x jdv cx -mJM 1 

with a similar result for the y- and z-directions. Equation (16-52) says that 

\m x v x + \m 2 v 2 = + m 2 )v c 2 + ipv 2 

and so Eq. (16-67) becomes 

dZ = Kp x p 2 3 27 zb db Vr e~ {ntl+m2)Vc2 i 2 kT dy c e~ tlv2,2kT ds r 

(27 zkT) 

We integrate this over the center-of-mass coordinates to get the desired result for the 
number of collisions per unit time per unit volume with relative velocity in the interval 
v r and v r + dv r and impact parameter in the interval b and b + db 

( ll \ 3/2 /2\ 1/2 

— j l-j 2nb db v 3 e~ flVr2 l 2k T dv r (1 6-68) 

Notice that this distribution has an extra factor of v r which essentially reflects the fact 
that molecules with a higher relative velocity collide more frequently. Problem 16-32 
discusses a paper by Levine and Birnbaum in which they present an elegant application 
of this distribution. 

To find the total rate of collisions, we integrate this over b and v r . In the case of 
hard spheres, the integral over 2nb db gives just tig 2 , and we find that 

Z = Kp x p 2 na 2 {v x 2 + z> 2 2 ) 1/2 (1 6-69) 

It is shown in Problem 16-25 that v r = (; v x 2 + v 2 2 ) 112 = (&kTfrm) i/2 , and so this for- 
mula for Z is 

Z = Kp x p 2 no 2 v r 


(16-70) 
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We can derive a number of interesting quantities from this one formula. If k = 1 
we get the rate of unlike-molecule collisions per unit volume 

Zj 2 = P 1 P 2 no 2 v r (1 6—71 ) 


If k = we get the rate of collisions per unit volume in a pure gas 


z n =— pw 

V* 


(1 6-72) 


where we have used the fact that v r = 2 1/2 v where recall that v = (SkTjnm) 112 . Lastly, 
if we focus upon one particular tagged molecule, then the number of collisions that it 
undergoes per unit time per unit volume is 


Zi = P 2 no 2 v r 


(1 6-73) 


We shall now calculate the mean-square change in momentum upon a collision. 
To be more precise, we wish to calculate the quantity 

1= J <(Ap 1 ) 2 >27t6 db (1 6-74) 

where the angular brackets denote an average over the relative velocity of the colliding 
particles. If we divide Eq. (16-68) by Eq. (16-70) (without the 2nb db or nc 2 \ we get 
the fraction of collisions with relative velocity between v r and v r + dv r 

p(i> r ) dv r = 1 v r i e~ llv ' ll2kT dv r (1 6-75) 

We now multiply Eq. (16-65) by Eq. (16-75) and integrate over v r to get 

<(Api) 2 > = », 5 (1 - cos x ) e -^ 2kT dv r (1 6-76) 

If we define a collision cross section by 

r 00 

<r(v r ) = 271 (1 - cos x)b db (1 6-77) 

J o 

then Eq. (16-74) becomes 

1 = p2 (/^) J 0 v MVr)e~ llVrll2kT dv r 

= 8 pkT \ y 5 <r(y)e~ y2 dy (16-78) 

where y 2 = \w 2 j2kT. We shall use this equation directly in Section 21-9. 

In the next chapter we shall introduce a subject that is probably unfamiliar to most 
readers, but one that plays a very important role in any discussion of transport pro- 
cesses and liquids in motion in general. We shall derive some of the principal equations 
of hydrodynamics and then discuss their simpler solutions and applications. 
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PROBLEMS 

16-1. Estimate the mean free path in a gas at 300°K and 1 atm: 0.1 atm; 0.01 atm. 
Assume a size parameter o = 3 A. 

16-2. Estimate the average time between collisions in a gas under the conditions of the 
previous problem. 

16-3. Estimate the number of collisions per second experienced by one molecule in a gas 
under the conditions of the previous two problems. 

16-4. Estimate the total number of collisions per second in a gas under the conditions of 
the previous three problems. 

16-5. At 25 °C, H 2 has a viscosity of 88 micropoise. Estimate the diameter of the hydrogen 
molecule. 

16-6. At 0 C and 1 atm, helium has a viscosity of 186 micropoise. Estimate the thermal 
conductivity of helium under these conditions. The experimental value is 33.8 x 10“ 5 
cal/sec • cm • deg. 

16-7. Give order-of-magnitude values for the following quantities in a gas at normal 
conditions 

(a) speed of sound 

(b) mean free path 

(c) mean time between collisions 

(d) duration of a collision 

(e) diffusion coefficient (include units) 

16-8. The van der Waals constant, b , of argon is 29.5 cm 3 /mole. Estimate an atomic 
diameter from this and hence calculate the viscosity of Ar at 25°C. 

16-9. Show that the escape velocity from a planet’s surface is given by 
v = (2gR) lf2 


where g is the gravitational constant and R the radius of the planet. Given that the gravita- 
tional constant of the earth is 980 cm/sec 2 and the radius is 6.4 x 10 8 cm, calculate the 
fraction of hydrogen, helium, nitrogen, and oxygen molecules having velocities exceeding 
the escape velocity. 

16-10. Repeat the previous problem for the moon, for which g = 167 cm/sec 2 and 
7? = 1.8 x f0 8 cm. 

16-11. Derive an expression for the mean free path in a two-dimensional gas. 

16-12. Derive the analog of Eq. (16-11) for a two-dimensional gas. 

16-13. Derive the analog of Eq. (16-31) for a two-dimensional gas. 

16-14. Derive the analog of Eq. (16-32) for a two-dimensional gas. 

16-15. Derive the analog of Eq. (16-33) for a two-dimensional gas. 

16-16. Show that the mean free paths in a binary mixture are given by 


/i = 


47TCT i 2 p t -f-7r(CT! +<J 2 ) 2 P2 


and 


4 
h 


7t(ct! + CT 2 ) 2 Pi + 47 T<J 2 7 P2 
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16-17. Prove that the distribution of free path lengths is given by 
p{L) dL^e~ Lfl dL 

Calculate the fraction of molecules with free paths greater than 0.5/; 1.0/; 2.0/; 3.0/. Note 
that this distribution is quite broad. 

16-18. What is the probability that a nitrogen molecule at STP travels (a) 10“ 3 cm, 
(b) 1 cm without experiencing a collision. 

16-19. In Section 16-1 we derived the formula A = £r)c v , where c=l in the simple 
theory and e = f in the more rigorous theory. The underlying physical explanation of this 
discrepancy was first pointed out by Eucken in 1913 (cf. Loeb in “Additional Reading,” 
pp. 247-250). He pointed out that for translational motion only, molecules with larger 
velocities have larger mean free paths, so that they travel farther and carry more kinetic 
energy than the more slowly moving molecules. Hence the transport of kinetic energy by 
these molecules is more effective, and this fact leads to a value of e > 1. Eucken argued that 
this would not be so for the internal energy carried by a polyatomic molecule, and that in 
this case £ would be unity. Thus Eucken was able to present a plausible, though certainly 
not rigorous, extension of the formula A = et)C v for polyatomic molecules. He argued that 
the thermal transport coefficient is made up of two contributions: 

A = (£trans CV.trans “1“ ^lnl CV.int)^ 

where £*«„* — f, £mt = 1, CV.tr.n, = 3R/2, and CV.mt = C y — CV.tran*. Show that this leads 
to the formula 

A = i(9y — 5)r)C y 

where y — C p iC v . This formula is known as Eucken’s correction. Show that this reduces 
to the rigorous result for a monatomic gas. Table 16-2 shows the agreement of Eucken’s 
formula with experimental data. 


Table 16-2. A test of the Encken correction formula 



V 

(10“ 7 cgs) 

A 

10” 3 cal 

Cy 

cal 

y 

A 

TjCy 

K9y-5) 

cm • sec • deg 

g • deg 

Helium 

1875 

0.344 

0.753 

1.660 

2.44 

2.485 

Neon 

2986 

0.1104 

0.150 

1.64 

2.47 

2.44 

Argon 

2100 

0.0387 

0.0763 

1.67 

2.42 

2.51 

h 2 

840 

0.416 

2.40 

1.410 

2.06 

1.92 

n 2 

1664 

0.0566 

0.178 

1.406 

1.91 

1.91 

o 2 

1918 

0.0573 

0.156 

1.395 

1.92 

1.89 

H 2 0 at 100C 

1215 

0.0551 

0.366 

1.32 

1.24 

1.72 

co 2 

1377 

0.0340 

0.151 

1.31 

1.64 

1.70 

nh 3 

915 

0.0514 

0.401 

1.32 

1.40 

1.72 

CH 4 methane 

1027 

0.0718 

0.400 

1.31 

1.75 

1.70 

C 2 H 4 , ethene 

948 

0.0404 

0.282 

1.25 

1.51 

1.56 

C 2 H 6 , ethane 

854 

0.0428 

0.325 

1.23 

1.54 

1.52 


16-20. Estimate the fractions of the coefficient of thermal conductivity due to the transla- 
tional and internal degrees of freedom for Ar; Cl 2 ; CCU; C 2 H 6 . 

16-21. Prove that 

niiivic' — vie) = m(v/ — v r ) 

16-22. Proye that the square of the momentum change during a collision is 
(A/wjVi) 2 — 2/x 2 iv 2 (l — cos x) 
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16-23. Prove that x~ 77 ^ 20 m . 

16-24. Show that the magnitudes of y lc and v 2c do not change during a collision. 
16-25. Show that the average relative velocity is given by 

Vr = (v i 2 +v 2 2 ) 1/2 



where /x is the reduced mass of the colliding pair. 

16-26. Show that the scattering angle x(b, s) for hard spheres can be determined analyt- 
ically to get 

x(b y g) = 2 arccos b<a 


= 0 b>a 

16-27. Derive an expression for the fraction of collisions in which the relative kinetic 
energy of the colliding pair of molecules is between E and E + dE. The integral of this from 
E* to oo is an important quantity in many theories of gas-phase chemical kinetics. Why? 

16-28. Derive an expression for the distribution of relative velocities in two dimensions. 
Use this to calculate the average relative velocity in a two-dimensional gas. 

16-29. Derive an expression for xfJ>, g) in terms of u(r) for a two-dimensional system. 
16-30. Derive the analog of Eq. (16-78) for a two-dimensional gas. 

16-31. Equation (16-73) says that the number of collisions of some one particular tagged 
molecule per unit time per unit volume is 


Zi = pna 2 v r 


Use this to show that the mean free path is given by 

_ 1 
^ 2 in 7rpa 2 

16-32. Levine and Birnbaum* have published an elegant treatment of the classical theory 
of collision-induced absorption in rare-gas mixtures which is based upon the kinetic theory 
of gases. First let us review what is meant by collision-induced absorption. We have all 
learned in physical chemistry that gases such as Ar, N 2 , C0 2 , etc., do not absorb in the 
microwave or far-infrared regions since the molecules do not possess permanent dipole 
moments, the necessary condition for absorption in that region. Consider, however, a 
mixture of two rare gases such as helium and argon. There will be many helium-argon 
collisions in this mixture, and during such a collision we can consider the helium-argon pair 
to be a transient heteronuclear diatomic molecule, albeit undergoing just one vibration. 
Regardless of the transient nature of this heteronuclear diatomic molecule, it does possess a 
(transient) dipole moment and hence leads to absorption in the microwave and far-infrared 
regions. Since this absorption occurs only as the result of a collision, it is called collision- 
induced absorption. Of course, the observed intensity is much less than in conventional 
microwave absorption, but nevertheless it was first observed in 1959 by Kish and Welsh? and 
later in more detail by Bosomworth and Gush.J 

There have been a number of theoretical analyses of such collision-induced absorption, 
but the one by Levine and Birnbaum referred to above is particularly simple and instructive. 

* Phys. Rev ., 154, p. 86, 1967. 
t Phys. Rev. Lett ., 2, p. 166, 1959. 

X Can . J. Phys ., 43, p. 729, 1965. 
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Their starting point is based upon some electromagnetic theory with which the reader may 
not be familiar, but if he is willing to accept this, the rest of the problem is fairly straight- 
forward. Levine and Birnbaum start with KirchhofFs law, which relates the absorption 
coefficient a(o>) to the power due to spontaneous emission per unit frequency interval per 
unit volume of sample /( o>) by 


a(o>) = 


/(*>) 


cp( T ) 


where c is the speed of light and p(o>, T) is the energy density in the blackbody radiation field. 
This is given by Planck’s law, Eq. (10-99), 


p(o>, T) dw 


ha > 3 daj 


7r 2 c 3 [cxp(haj/kT) - 1] 


The quantity /( o>) can be written as a statistical average over the power due to collisions of 
impact parameter b and relative speed v r according to 


/(*>) — JJ Z(b, v r )I(b, v r , oj) db dv r 


where Z(b, V r ) is the number of collisions with impact parameter b and relative speed v r 
[Eq. (16-68)] and t(b y v r , o>) is the power emitted by collisions of this type. 

The emission intensity /(6, tv,a>) is given by electromagnetic theory by the so-called 
Larmor power formula: 


2^4 co 

f(b,v r ,<o) = -^^ J dte ic *yib,v,,t) 


where [i(b, v r , t) is the electric dipole moment of the colliding pair as a function of time for 
a given b and v r . This formula is essentially the power emitted by an oscillating dipole 
moment. 

Levine and Birnbaum write the dipole moment as 


V- = n-to 


H(r)r 


to denote that p. depends only upon the separation r of the two colliding atoms, and they 
write 

r = r(b, v r , t) 


to specify the collision trajectory. Clearly r depends on the intermolecular potential u(r\ but 
Levine and Birnbaum neglect the potential and assume straight-line paths, with 

x — v r t y — b z — 0 

Although this is certainly not correct, particularly for small v values, nevertheless it has two 
virtues. The first is that it leads to simple mathematics; in particular, we do not have to 
compute trajectories, and the subsequent derivation of the line shape is easy. Second, the 
comparison of the results of such a calculation with experiment helps us to assess the impor- 
tance of the potential in fixing the lineshape, i.e., helps us decide whether or not lineshape 
measurements can be expected to yield useful information about the potential function. 

Although the dependence of the dipole moment function on the separation r is not yet 
known, it is probably approximately Gaussian looking, and Levine and Birnbaum assume 
that 


p,{r) = p, 0 yre- y2/2 
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where y” 1 is the range of the dipole moment and characterizes its strength. The factor yr 
has the following significance: If one considers a realistic potential, then its repulsive part 
prevents two atoms from approaching too close to each other; i.e., the distance of closest 
approach is of the order of the range of the repulsive component of the potential. In a manner 
of speaking, the potential function “ shuts off” the dipole moment for very small r values. 
Similarly, the factor yr “ shuts off” the dipole moment when r < y~\ Thus, roughly speaking, 
the absence of a repulsive potential has been corrected for in the form used for n(r). 

Now the calculation of the complete lineshape a(o>) for the collision-induced absorption 
in rare-gas mixtures follows in a fairly straightforward manner. Show that the Fourier 
transform of jx has two nonvanishing components: 

ixoam 112 ( a> ) 

Mb, »„«») = exp(-y^ - 

fio bn 1 ' 1 I oj 2 | 

Mb, v„a>) = exp j-y V - — j 

where 

Kb, v,,i*)= + £/) 

Now multiply f(b, v r , a. >) by Z(b 9 v r ) [Eq. (16-68)], integrate over b, and show that the re- 
maining integrals are of the form 

f°° ( fJLVr 2 w 2 ] 

with n = — 1 for the ^-component and n = 1 for the y-component. Using the fact that 



Figure 16-4. The absorption coefficient for a helium-argon mixture at 295°K. The points are the experi- 
mental data of Bosomworth and Gush {Can. J. Phys. 43, 729, 1965), and the solid line is 
calculated by Levine and Birnbaum for y — 1.357 A -1 and fi 0 = 0.166 debyes. (From H. B. 
Levine and G. Birnbaum, Phys. Rev., 154, p. 86, 1967.) 
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where K n (z ) is a modified Bessel function of the second kind and that 


dK n (z) 

dz 


i[K n ^(z)-K n+1 {z)] 


and 


zK n ^(z) + zK„+i(z) = 2nK„(z) 


show that the line shape is given by 


a(o>) 




12cy s 


(-U 

\7 r\ikT / 


1/2 


x*K 2 (x) 


where x is a reduced frequency 

aj / fM \ 1/2 2ttc<j ( \ 1/2 

x = yw ) =_ j r \w ; ) 

where a is the frequency in wave numbers. 

The reduced lineshape function A(o) = a(o)(pi p 2 )~\N 0 !Vo) 2 (N 0 - Avogadro’s number, 
V 0 = 22413.6 cm 3 ), which is the absorption coefficient measured by Bosomworth and Gush, 
is compared to their experimental data in Fig. 16-4. It can be seen that p 0 and y can be 
chosen so as to give excellent agreement; Levine and Bimbaum discuss the significance of 
this agreement in some detail. 



CHAPTER 17 


CONTINUUM MECHANICS 


In this chapter we shall give an introduction to a field that is generally known as 
continuum mechanics. Continuum mechanics includes the areas of heat flow, fluid 
flow or hydrodynamics, and the mechanics of deformable bodies as special cases. 
The central idea in continuum mechanics is to define a local density, velocity, and 
energy that are continuous functions of space and time. Because of the molecular 
nature of matter, such concepts can be quite awkward to define rigorously, but what 
we mean by a local density, for example, can be defined by considering the mass 
contained in a small volume, Av say, where this small volume is large enough to 
contain many molecules (so that the density is a continuous function of r) but small 
enough that the mass divided by Av describes a local density at the point r. In short, 
we treat matter as continuous. We then derive a set of partial differential equations 
describing the spatial and time variations of these local density functions. The solu- 
tion of these equations for given conditions and geometries becomes a boundary- 
value problem of mathematics. The partial differential equations we derive in Section 
17-1 are simply a mass balance equation, a momentum balance equation, and an 
energy balance equation. Thus we see that in Section 17-1 we simply formulate the 
fundamental conservation laws in a continuum mechanics formalism or notation. 

Since continuum mechanics in general and hydrodynamics in particular are some- 
what neglected topics in most undergraduate curricula nowadays, the material of this 
section will probably be unfamiliar to most readers. Therefore, Sections 17-2 and 
17-3 are devoted to solving the basic continuity equations for a number of special 
cases. For example, in Section 17-2 we shall derive and solve the diffusion equation, 
the heat flow equation, and Euler’s equation; in Section 17-3 we shall derive and 
discuss the Navier-Stokes equation. These last two equations are fundamental equa- 
tions in hydrodynamics, and from these we shall derive some familiar results such 
as the Poiseuille formula for the viscosity of fluid flow through a cylindrical capillary. 
This is a standard physical chemistry experiment for the determination of the viscosity 
of a liquid. 
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17-1 DERIVATION OF THE CONTINUITY EQUATIONS 

For simplicity, we shall consider only a one-component system. The extension of 
the results of this section to multicomponent systems is straightforward. We shall 
consider the three basic continuity laws and derive a balance equation for each one. 


MASS BALANCE 

Let p m (r, t) be the mass density of the system at the point r at time t. Let u(r, t) be 
the velocity of the mass at r and t. Furthermore, let v be an arbitrary fixed volume 
located within our fluid and S be the surface of this volume. The total mass within 
v is 

M= f p m (r,t)dr (17-1) 

J v 

The rate of change of mass within v is 



(17-2) 


since v is fixed in space. Now since mass is neither created nor destroyed, the rate 
of change of mass within v must be given by the rate at which mass flows through the 
surface S. 

We define a surface element vector dS such that the magnitude of dS is equal to 
the area dS and the direction of dS is directed normally outward from the volume 
that the entire surface encloses. If n is a unit vector directed outward from the enclosed 
volume, then dS = n dS. Figure 17-1 shows this surface area element and the 
vector n. 

The rate of flow of mass is p m u, and so the component of flow normal to a surface 
element dS is p m u • dS. Integrating this over the entire surface gives 


dM 

~dt 



(17-3) 


as the total rate of flow of mass through the surface. The negative sign simply in- 
dicates that an outflow of mass yields a negative value for M = dM/dt. Recall that 
u • dS is positive if u is directed outward and negative if u is directed inward. The 
surface integral in Eq. (17-3) can be transformed to a volume integral by means of 
Gauss’ divergence theorem: 

V(p m vL)dr (17-4) 




Figure 17-1 . An elementary surface area dS, the nnit vector n, and the mass velocity vector u. 
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If we subtract Eq. (17-2) from Eq. (17-4) for M, we get 

but since the volume v is arbitrary, we must have 

dp, 


dt 


+ V-(p m u) = 0 


This is the continuity of mass equation. 

By using the vector equation (Problem 17-1) 

V • (av) = flV • u + u • Vfl 

we can rewrite Eq. (17-5) in the form 

Dp 

■g^ + A.Vu-0 
where 

D d 

— = — + u- V 
Dt dt 


(17-5) 


(17-6) 


(17-7) 


(17-8) 


This derivative occurs frequently in continuum mechanics and is called the hydro- 
dynamic derivative, the substantial derivative, or the Stokes’ operator. Physically, 
Dh{ r, t)/Dt represents the change of h{ r, t) along the flow of the system, i.e., along a 
streamline (Problem 17-2). 


MOMENTUM BALANCE 

We can now apply the same method to derive a momentum balance equation. The 
momentum density at any point is given by p m u, and the momentum within v is 


G= f Pm^dr 

J v 

Its rate of change is 



(17-9) 


(17-10) 


There are two contributions to the rate of change of momentum One is the total 
force acting on the mass within v , and the other is the rate at which momentum flows 
in and out of v. There are two force terms to consider: (1) external forces such as 
gravitational or centrifugal forces, and (2) forces exerted by the fluid surrounding v , 
i.e., stress or pressure forces. If K is the external force per unit mass, then the first 
contribution F x is given by 


Fi = fp«K* (17-11) 

J V 

In order to discuss the contribution due to the stress or pressure force, we must 
introduce the concept of a stress or pressure tensor. 

Pressure is a concept that is familiar to all chemists as force per unit area. One is 
accustomed to a pressure reading of 10 6 dynes/cm 2 , for example. A single number like 
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y 


X 

Figure 17-2. Significance of the components of the pressure tensor. 

this implies that pressure is a scalar quantity, but this is not so in general. In fact, 
pressure is neither a scalar nor a vector quantity, but is a tensor. To understand what 
is meant by this, consider a small planar element of unit area located within a fluid. 
Let the normal of this surface area be the direction along the y-axis as in Fig. 17-2. 
We now investigate the forces acting upon this area. The force on this area has three 
components. Let the component of the force acting in the jc-direction be denoted by 
p xy , that in the y-direction be p yy , and that in the z-direction by p zy . In other words, 
p^ is the force in the ath direction acting on a surface of unit area that is perpendicular 
to the y-axis. Now we can do the same thing for areas perpendicular to the x- and z- 
axis. Each of these will have three components of force acting upon it, and hence we 
will have six more components of pressure, namely, p ax and p az , a = x, y, z. Since 
any arbitrary area can be resolved into three mutually perpendicular components, it 
takes nine pressure components to completely describe the pressure acting upon this 
area. These nine quantities are the components of the pressure tensor p, which can 
be displayed in the form 

( Pxx Pxy Pxz\ 

Pyx Pyy Pyz I (17—12) 

Pzx Pzy Pzzl 

The only way a fluid can exert a component of pressure in the plane of a test area 
is by a viscous drag. Such a force is called a shear force and will be nonzero only for a 
viscous medium. In the special case of a nonviscous medium, i.e., one in which we 
can ignore viscosity effects, the off-diagonal terms of p are zero, and we have 

I Pxx 0 0 \ 

P = 0 Pyy 0 

\ 0 0 pj 

At equilibrium p xx = p yy = p zz = p> and we have 
Ip 0 0\ /I 0 0\ 

p = 10 p 01 =p|0 1 01 =p\ (17-13) 

\0 0 p) \0 0 1/ 

where I is the unit tensor, whose components are the Kroenecker delta, S iJm (The 
Kroenecker delta is a tensor.) The scalar quantity p is what is usually referred to as the 
pressure. We should see from this that pressure is a somewhat more complicated notion 
than many chemists are used to thinking. 
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The /th component of the force acting upon an arbitrary area S is then 

F i = 'LPijSj (17-14) 

j 

which we write in tensor notation as 
F = p • S 

Equation (17-14) defines this tensor operation. The total pressure force acting upon 
the mass enclosed within the volume v is 


F 2 = 



(17-15) 


where the negative sign signifies that a pressure acting upon the mass in v gives a 
positive force. 

Lastly, we need to calculate the rate of change of momentum due to a flow of 
momentum in and out of v . Momentum is given by p m u, and so a flow of momentum 
is given by p m uu. The quantity uu has nine components, u x u } , which represent the ith 
component of momentum being carried in the yth direction. The quantity uu is the 
direct product of two vectors and is actually a special kind of tensor called a dyadic. 
The rate at which momentum flows across S is 


flow = - p m uu • dS (17-16) 

J s 

The dot product of the tensor uu and the vector dS appearing here can be readily 
handled by writing it as u(u • dS ). Sometimes it is most convenient to manipulate 
uu as a formal tensor quantity, however. 

If we add Eqs. (17-10), (17-1 1), (17-15), and (17-16) and use the divergence theorem 
to transform all the surface integrals into volume integrals and then realize that v is 
arbitrary, we get the momentum balance equation (Problem 17-4) 

^^ + V-(p m uu) = p m K-V-p (17-17) 

This equation involves taking the divergence of the tensor uu. Generally, the y'th 
component of V • A is 

(V-A A u (17-18) 

Equation (17-17) is the momentum balance equation. By using various vector and 
tensor relations, it can be written in several equivalent forms. For example, if we use 
the relations (Problem 17-6) 

V • (p m rni) = p m (u • V)u + u(V • p m u) (17-19) 

and 

d(p m ii) du dp m 

and the mass continuity equation, we get (Problem 17-5) 


(17-20) 
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The form of Eq. (17-21) clearly shows that this is just the continuum mechanics 
expression for Newton’s second law. It is is the equation of motion in a continuum 
notation. 


ENERGY BALANCE 

Lastly, we derive the energy balance equation. The total energy contained within 
v is 

£ = [ p n [E + i« 2 ] dr (1 7-22) 

J V 

where p m E is the total energy, including potential energy due to external forces but 
not including the kinetic energy due to the mass flow \p m u 2 . The rate of change of 
E is 

f r? 3 

-Jt = J Jt + i" 2 )} * 0 7 -23) 

Now E can change with time for three reasons: (1) Work is done on the mass in v\ 
(2) energy flows in or out by convection currents; and (3) energy flows in or out by 
conduction. These three contributions are given by 


dE 

dt 


= — f u * p • dS - f p m [E + iu 2 ]u • dS — f q • dS 

Jc J c Jc 


(17-24) 


The vector q in the third term is the heat flux vector. The only term that should not be 
clear is the first. To see that this is, in fact, the work done on the mass in v , recall that 
p • dS is a force, and that the scalar product of force with velocity is rate of work, 
i.e., u • (p • dS) is the rate of work through the area dS. 

Equations (17-23) and (17-24) give 

J t [pm(E + i« 2 )] = - v ■ (u ■ p) - V • [p m u(E + \u 2 )\ - V • q (17-25) 


We can express this in a more convenient form by using the identity (Problem 17-6) 
V • (u ■ p) = u - (V • p) + p: Vu (17-26) 

where the double dot notation denotes the standard tensorial manipulation 

A:B = EZMi 

* j 

which in the above case becomes 


p:Vu = 2> v 


UJ 


du t 

dxj 


(17-27) 


Using this and Eq. (17-6), Eq. (17-25) becomes 


[p m (E + i« 2 )] = -u • (V • p) - p: Vu - p m (E + i« 2 )V • u - V • q (1 7-28) 


The left-hand side can be written as 
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Using the mass continuity equation in the form of Eq. (17-7), we see that the second 
term in this equation cancels the third term on the right-hand side of Eq. (17-28), 
giving 

P m -^[£ + i« 2 ]= — u • (V-p) — p: Vu — V • q (17-29) 


We can use the momentum balance equation, Eq. (17-21), to write 


Du 

u ' (V * p) = p m ii * K — p m u • — 


= p m u-K- \p m 


Du 2 
~Dt 


and so Eq. (17-29) can be written in the form (Problem 17-32) 

DE 

Pm D7“ Pn,U#K= “ p:Vu “ V ' q (17-30) 

Since E is the total energy including potential energy due to external sources, but not 
including the kinetic energy due to mass flow, and p m u • K is the rate of change in 
potential energy due to external forces, Eq. (17-30) can be written in the form 
(Problem 17-32) 

DE 

9m 15i = “P :Vu_ V ’^ (17-31) 


where E is the total energy per unit mass within v exclusive of the potential energy 
due to external fields and the kinetic energy due to the mass flow. 

Equations (17-7), (17-21), and (17-31) are the fundamental equations of continuum 
mechanics. They can hardly be solved as they stand since they contain the unknown 
quantities p and q, which in some sense are properties of the medium itself. A common 
assumption about q is to assume that q is given by Fourier’s law, i.e., that 

q = —A grad T (17-32) 

where A is the thermal conductivity of the system. The empirical law for the pressure 
tensor is probably less familiar. The most general linear expression for p is to write 


n — —V r/ kt ^ 

Pij L a ij ^ 
k,l VX k 


(17-33) 


Note that this is a generalization of Eq. (16-14). The {a lV w } is a set of 81 coefficients, 
but fortunately not all of these are independent. If the body is not undergoing any 
translation or rotation, i.e., only being distorted, and if the medium is isotropic, 
e.g., a fluid, it can be shown (cf. Hirschfelder, Curtiss, and Bird in “Additional 
Reading”) that p must be of the form 


p as [p + ($rj — k)V • u]l — 2 rj sym(Vu) (1 7-34) 

In Eq. (17-34), rj is the coefficient of shear viscosity; k is the coefficient of bulk viscosity, 
and sym(Vu) is a tensor with components 


(sym Vu ) y = 



(17-35) 
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The coefficient of shear viscosity is a measure of the resistance to a shearing force, 
such as in capillary flow. This is what is usually measured and what is referred to as 
44 viscosity.” The coefficient of bulk viscosity is a measure of the resistance to a com- 
pressive force and is fairly difficult to measure experimentally. Equation (17-34) is 
called the Newtonian pressure tensor. Note that if there are no viscous forces, i.e., 
if rj = k = 0, then p reduces to pi. Note also that at equilibrium, u = 0, and p again 
reduces to pi. 

17-2 SOME APPLICATIONS OF THE FUNDAMENTAL 
EQUATIONS OF CONTINUUM MECHANICS 

In this section we shall reduce the general, exact equations of continuum mechanics 
to some simple, special cases. For example, suppose that we assume that the rate of 
flow of mass p m u is proportional to the gradient of the density. This is known as 
Fick’s law of diffusion. Mathematically, this says that 

p m u= — D grad p m 

where D is a constant called the diffusion constant. This equation is known to be 
empirically valid when the gradient of the density is small. Substituting this into the 
continuity equation [Eq. (17-5)] gives 

jjf = D div grad p m = DV 2 p m (1 7-36) 

This is the diffusion equation. 

The diffusion equation is usually applied to the diffusion of one species through 
some medium, in which case it is written in the form 

dc ~ 

— — D V 2 c (17-37) 

where c is the concentration of the diffusing species. The solution to this equation 
depends upon the geometry of the system and the initial concentration distribution. 
A fundamental solution to this equation, however, is for diffusion in an infinite 
medium (no boundaries) with the initial condition that the diffusing substances is 
initially located at the origin. This solution is called the Green’s function of the dif- 
fusion equation, and one can use this to construct solutions for other geometries and 
initial conditions. 

Let us consider the case of diffusion in one dimension. In this case, Eq. (17-37) 
becomes 

dc d 2 c 

— = —co<x<co (17-38) 

dt dx 2 

with the initial condition 

c(x 9 0) = c 0 5(x) (17-39) 

This equation is most easily solved by defining the Fourier transform of c(x, t) by 

C(k, t) = (2tt)“ 1/2 f e lkx c(x, t) dx (17-40) 

^ -co 

with 

c(x, t ) = (2rc)~ 1/2 f e~ lkx C(k 9 1) dk 

^ — an 


(17-41) 
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It is instructive to solve this same problem in three-dimensional spherical coordinates 
assuming that the diffusion is isotropic. In this case, the diffusion equation is 


dc 2 
— = DV 2 c 
dt 

with the initial condition 
c(r, 0) = c 0 <5(r) 

We define a three-dimensional Fourier transform of c(r, t) according to 


DU 

C(k, 0 = (2tc) 3/2 JJJ e lkr c(r, 0 dr 


whose inverse is 


c(r, 0 = (2tc) -3/2 JJJ e~ ik 'C(k, t ) dk 


From Eq. (17-49) we see that 


grad c( r, /) = - JJJ e 4 ' 'A*. 0 dk 


and that 


div grad c( r, f) = V 2 c(r, t) 


Ak, 0 ^ 


= - k 2 c(r, t) 

Thus if we take the Fourier transform of Eq. (17-46), we get 

m) 


(17-46) 


(17-47) 


(17-48) 


(17-49) 


(17-50) 


with the initial condition that 

C(h, 0) = (2tc) " 3/2 c 0 = C 0 (17-51) 

The solution to Eq. (17-50) is 

k, 0 = C 0 e~ Dk2t (1 7-52) 

Although this equation is similar to Eq. (17-44), their inverses are quite different 
since Eq. (17-44) describes a one-dimensional Fourier transform and Eq. (17-52) 
describes a three-dimensional Fourier transform. The inverse of Eq. (17-52) is given 
by (cf. Appendix B) 

00 

c(r, 0 = (2rc)- 3/2 JJJ e~ ik 'C{k, t ) dk 

— 00 

CO 

= c 0 (27t) _ 3 JJJ e~ ik • r e~ k2Dt dk 

— oo 


c 0 -r 2 /4Df 

8(tc£>0 3/2 


(17-53) 
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One can show from this that 


r oo 

c(r, t)4nr 2 dr = c 0 (1 7-54) 

J o 

The reader is referred to the problems and to the “Additional Readings ” for further 
discussion of the diffusion equation. 

As another simple example, if we set u = 0 in Eq. (17-31), use Fourier’s law, and 
assume that E = E(T) y we get 


dT 

v ~di 


p m c.,~ = XV 2 T 


(17-55) 


where c v is the specific heat. This equation is the heat conduction equation and has 
essentially the same form and the same solutions as the diffusion equation. 

The mass balance and energy balance equations, then, are just generalizations of 
the diffusion equation and the heat flow equation, probably familiar to most physical 
chemistry students. The momentum balance equation, on the other hand, reduces to 
equations that are probably not familiar. These equations, however, are fundamental 
equations of hydrodynamics, and so it is necessary that we become familiar with these 
equations and some of their simpler applications. 

Perhaps the simplest hydrodynamic equation is that describing nonviscous flow. 
This equation is called Euler’s equation and is obtained immediately from Eqs. (17-21) 
and (17-34) by setting 17 = k = 0 in the pressure tensor. Euler’s equation is 

Id \ 1 

— + u ‘ V)u + — Vp = K (17-56) 

\d* / Pm 

and is a fundamental hydrodynamic equation describing nonviscous flow. 

Let us now consider a monatomic ideal gas with no heat flow and no viscosity. 
We can cancel the factor of m that occurs in both terms of Eq. (17-7) to give 


Dp 

— + pVu-0 


(17-57) 


This equation, with the number density p instead of the mass density p m , is the equa- 
tion for conservation of number of particles. If we use the fact that I : Vu = V • u 
(Problem 17-6), then Eq. (17-31) becomes 

DT p 

— + -^— V*u = 0 (17-58) 

Dt c v p m 

where pjc v p m in Eq. (17-58) is equal to 2T/3 for an ideal monatomic gas. The set of 
Eqs. (17-56) to (17-58) is called the ideal hydrodynamic equations. We can subtract 
Eqs. (17-57) and (17-58) to get 


Dp 3 p DT 
~Dt ~2T~Dt 


or 


— ( P r- 3 ' 2 ) = o 


(17-59) 


or 


pT 3/2 = constant = C x (along a streamline) 


(17-60) 
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Using the equation of state of an ideal gas, p = pkT \ Eq. (17-60) becomes 

p V 513 s constant = C 2 (1 7-61 ) 

which says that the gas expands (or compresses) adiabatically along a streamline since 
Eq. (17-61) is the thermodynamic equation for the adiabatic change of an ideal gas, 
i.e., pV y = constant, where y = C p /C v = % for a monatomic ideal gas. 

We can now use this equation and Euler’s equation to derive the equation for the 
propagation of sound waves through a gas. To do this we assume that the gas is in 
equilibrium except for the passage of the sound waves. We can then assume that u 
and all of the space and time derivatives of u, p m , and T are small first-order quantities. 
Neglecting second-order quantities then and setting K = 0, Eqs. (17-56) and (17-57) 
become 

^ = Pm • Vu = 0 (17-62) 

3 “ 

P-ei + Vp = 0 


Multiply the first of these by djdt and the second by div and again neglect second-order 
terms to get (Problem 17-9) 



(17-63) 


Ordinarily the pressure is a function of the density and the temperature, but the 
density and temperature are related in this case by Eq. (17-59), which becomes 


3 dT = T ^Pm 
2 Pm dt dt 


when second-order quantities are neglected. We choose density as the independent 
variable and write 




(1 7-64) 


where in the last step we have assumed that ( dp/dp) s does not depend on the density. 
The subscript S refers to constant entropy since we have no heat flow. From the 
equation pV 5,z — constant, we find that (dp/dp) s = 5p/3p, which is independent of p 
for an ideal monatomic gas. 

Define an adiabatic compressibility by 


_>(« 

Pm \Opfs P \°PJs 

Substitution of Eqs. (17-64) and (17-65) into (17-63) gives 
2 &P 


(17-65) 


(17-66) 


for the spatial and time dependence of the density. This is a wave equation and de- 
scribes the propagation of density waves (sound waves) through a gas with velocity 

c -o dp < "- 67) 
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For an ideal monatomic gas. 




(17-68) 


This shows that the speed of sound is approximately equal to the mean molecular 
velocity of the molecules of the gas. This compares very favorably with experimental 
values. 

We can almost immediately derive Bernoulli’s equation from Euler’s equation. 
Remember that Bernoulli’s principle states that the pressure exerted on a surface 
varies as the inverse square of the velocity of the fluid passing over the surface. Start 
with Euler’s equation (Eq. 17-56) 

+ u • v) u = K - — Vp (1 7-69) 

\Ot ] p m 


Assume we have a steady flow, i.e., that dujdt = 0, and that the external force is con- 
servative, i.e., that K = We can use the vector identity (Problem 17-6) 

iVw 2 = (u • V) u + u x (V x u) (1 7-70) 


to write Eq. (17-69) in the form 

v(i« 2 + — + = u X(V X u) - kTVpm 

\ Pm I m Pm 

Now assume that the fluid is incompressible, which means that the density is constant 
throughout the volume of fluid and throughout its motion. From the equation of 
continuity, Eq. (17-7), we find that the condition of incompressibility can be written 
in the form V • u = 0. Furthermore, assume that V x u = 0. When this condition is 
satisfied, one says that the flow is irrotational. In the modern literature, V x u is 
denoted by curl u, but in the older literature you will find rot u. The quantity V x u is 
called the vorticity vector in hydrodynamics. With these two conditions, we have 

v{±« 2 + y + 'l' j = 0 

or 

\u 2 + — + }]/ = constant (1 7-71 ) 

Pm 

This is Bernoulli’s equation and shows that the pressure is inversely proportional to 
u 2 . In the next section we shall discuss the solution of the Navier-Stokes equation for 
several simple systems. 


17-3 THE NAVIER-STOKES EQUATION AND ITS SOLUTION 

Euler’s equation is the fundamental hydrodynamic equation of motion for non- 
viscous fluid flow. The fundamental hydrodynamic equation describing viscous flow 
is obtained by substitution of the pressure tensor given by Eq. (17-34), 

P = {P + {in ~ *)V * u}l - 2*7 sym(Vn.) 
into the momentum balance equation, Eq. (17-21), 

Pm {jt + U ■ V ) U = P* K ~ V ' P 
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to get the Navier-Stokes equation (Problem 17-7) 


,(!+«• v) u 


= p m K + t]V 


MH 


grad div u — V/> 


(17-72) 


The Navier-Stokes equation is the hydrodynamic equation governing the flow of a 
viscous fluid. 

The Navier-Stokes equation is often seen written in the form (Problem 17-10) 


= V • (O - p m UU) + p m K 


(17-73) 


where the tensor a is called the stress tensor and is given by 
<*= “P 

The stress tensor itself is often written in the form 
<r = — /?l + k(V • u)l + 2 rje 
where e is the divergenceless part of Vu, 
e = sym(Vu) — J(V • u)l 


(17-74) 


(17-75) 


(17-76) 


and is called the rate of shear tensor. 

Since the Navier-Stokes equation plays an important role in the statistical mechani- 
cal theory of viscosity (cf. Section 21-8) and, in addition, is probably unfamiliar 
to most readers, we shall discuss here several of its simpler applications. This section 
is somewhat of a digression since the results here are not used later in the book, but 
nevertheless, many of the standard textbook problems of hydrodynamics are not only 
interesting in themselves, but also serve as excellent exercises in boundary-value 
problems in various coordinate systems. In fact, it is unfortunate that hydrodynamics 
has disappeared from the curriculum of theoretical chemistry and physics since it is a 
rich applied mathematical experience, as the perusal through the references listed in 
the “Additional Reading” would show. 

Let us consider first the simple case of the steady flow of a viscous fluid between 
two parallel plates as shown in Fig. 17-4. Choose the flow to be in the x- direction and 
the perpendicular to the plates to be the j-direction. We consider the parallel plates 
to be infinite in extent, or, in other words, we shall ignore edge effects. In addition, 
we shall assume that the fluid is incompressible, so that we have that V • u = 0. 
The Navier-Stokes equation becomes 


pm ~Di =z ~ Vp + PmK + r}V u 


(17-77) 



Figure 17-4. The velocity profile of plane Poiseuille flow between two parallel infinite planes. 
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Now due to the geometry of this simple system, u y = u z = 0 and dujdx = dujdz = 0 
in the steady state. Thus, Eq. (17-77) becomes 


dy 2 dx 


dp 

- — + pmg = 0 
dy 


(17-78) 


where we have included the possibility of a gravitational force in the j-direction. That 
dpjdx cannot depend upon y can be seen by differentiating the second of these equa- 
tions with respect to x to get 



which shows that dpjdx cannot depend upon y. If we let the gradient of the pressure, 
dp/dx 9 be a constant — y, then the first of Eqs. (17-78) gives 


«*O0 = ~^~y 2 + c i y + c 2 (1 7-79) 

2*7 

where c x and c 2 are two constants of integration. A standard boundary condition in 
viscous flow is to assume that the fluid sticks to the boundary surface, and so in this 
particular case we have w,(0) = u x (b) = 0 and Eq. (17-79) becomes 


u x (y) = ^y(b- y ) 

2*7 


(17-80) 


This velocity profile is shown in Fig. 17-4. Flow of this type is called plane Poiseuille 
flow. Problem 17-27 illustrates plane Couette flow, which is the steady-state flow 
when one of the planes moves past the other with a constant velocity U y and Problem 
17-29 illustrates the flow down an inclined plane. 

The concept of Poiseuille flow should be familiar to most physical chemists since 
the timing of the flow of a viscous liquid down a cylindrical capillary is a standard 
physical chemistry experiment for the determination of viscosity. For this cylindrical 
case, Eqs. (17-78) take the form 


fd 2 u 1 du\ dp 
r, \dr i + 7dr) = 8z 


(17-81) 


where r is the radial distance and z is the distance along the cylinder. The radius of 
the capillary is R . We choose dp/dz to be a constant y = {p x — p 2 )/l , where / is the 
length of the capillary and (p x — p 2 ) is the difference in pressures at the two ends. 
Equation (17-81) is easily integrated if one uses the identity 


d 2 u 1 du Id/ du\ 
dr 2 + r dr r dr \ dr) 


(17-82) 


Two simple integrations give 
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Since w(0) must be finite, c x must be zero, and since u(R) — 0 due to the no-slip 
boundary condition, we have 


w(r) = j-(r 2 -i? 2 ) 
4i/ 


The volume of fluid crossing any normal section of the capillary per unit time is 


2 Pt) 


(17-83) 


which is the standard formula of Poiseuille flow. 

The calculation of the velocity profile for the steady unidirectional flow through a 
rectangular conduit is an instructive problem. If we assume conditions similar to 
those in the previous examples, the Navier-Stokes equation is 



(17-84) 


where y is a constant pressure gradient and u is the velocity along the conduit. The 
geometry is shown in Fig. 17-5. Equation (17-84) is an inhomogeneous partial dif- 
ferential equation. Its solution can be found by first writing 

«(*» y) = We(*> y) + u p( x > JO 0 7 -85) 

where u c (x, y ) is the complementary solution, i.e., the solution to the homogeneous 
equation 



and u p (x , y) is a particular solution to the inhomogeneous equation. Certainly 


u P (x,y) = ^y 2 + c 1 y + c 2 

is a particular solution to Eq. (17-84), and the no-slip boundary condition u p = 0 for 
y = ± c/2 gives 



y 



Figure 17-5. The coordinate system for a rectangular cross section describing steady unidirectional flow 
through a rectangular conduit. 
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Figure 17-6. Velocity contours for the steady unidirectional flow through a rectangular conduit. (From 
H. Rouse, ed., Advanced Fluid Mechanics. New York: Wiley, 1959.) 


The complementary solution can be found by separation of variables to be (Problem 
17-31) 


, \ v a u^ 2n ~ ( 2n - Vfry 

«c(X y) = L A n cosh ~ COS 

n— 1 2c 2c 


where the A„ are given by 

(2 n — 1)7 ib 2 c 2 y f . cc B a B \ 

' < - COSh — 5 ^( 2sm 2-“" COS 2) 


( 17 - 86 ) 


where a B = (2 n — l)7c/2. The velocity contours, u(x, y) are shown in Fig. 17-6 for the 
case b = 2c. 

Although this section has been a rather lengthy digression, it is hoped that this 
brief introduction to fluid mechanics gives the reader some degree of comfort and 
familiarity with the Navier-Stokes equation. 

This concludes our presentation of continuum mechanics. In the next chapter we 
shall derive these same equations from within a molecular framework. In this way, 
we shall obtain expressions for the various transport coefficients in terms of molecular 
properties. We now go on to a main goal of this book, that is, to determine the molec- 
ular basis of the theory of transport. 
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PROBLEMS 

17-1. Prove the vector identities: 

(a) div grad a = V 2 a 

(b) curl grad a = 0 

(c) div curl v = 0 

(d) curl curl v = grad div v — V 2 v 

(e) V • (av) = aV • u 4- u • Va 

(f) V • (v x u) = u • V xv — v*V xu 

(g) V(v *u) = (vxV)xu + (uxV)xv + v(V • u) + u(V • v) 

17-2. Convince yourself that 
D6 d6 

— = — 4- u • V0 
Dt dt 

represents the change in 0 (r, t) along the flow of the fluid, moving with velocity u. 

17-3. Show that the condition of incompressibility can be expressed by div u = 0. 

17-4. Derive the momentum balance equation, Eq. (17-17). 

17-5. Convert Eq. (17-17) into Eq. (17-21). 

17-6. Prove the tensorial identities 

(a) V • (uu) = (u • V)u + u(V • u) 

(b) V * (miu) = a { u • V)u 4- u(V • mi) 

(c) V • (u • B) = u - ( V • B) + B : Vu 

(d) l:Vu = V'U 

(e) V • (V • u)l = V(V • u) 

(f) V • (sym V • u) = JV 2 u + JV(V • u) 

(g) v • Vv = JVv 2 — v x (V x v) 

17-7. Substitute the Newtonian pressure tensor into the momentum balance equation to 
derive the Navier-Stokes equation, 

dll 

Pm — = — V /7 — ($77 — #c)V • (V • u)l + 2 t?V • sym Vu) 

Now use the tensorial identities proved in Problem 17-6 to derive Eq. (17-72). 

17-8. For slow, smooth fluid flow, one often uses the so-called linearized Navier-Stokes 
equation 

du 

pm — = — Vp + 7}V 2 U + ($7) + #C)V(V * U) 
ot 

Discuss the physical assumptions involved in deriving the linearized Navier-Stokes equation 
from the complete equation. We shall use this linearized form in Section 21-8. 

17-9. Derive Eq. (17-63). 

17-10. Show that the Navier-Stokes equation can be written in the form 
d 

-- p m U = V * (0 — Pm Ull) + PmK 
Ot 

where o = — p is called the stress tensor. 

17-11. Show that the stress tensor can be written in the form 

c = —p\ 4- *(V • u ) I 4- 2 rje 
where 

e = sym(Vu) — HV • u)l 
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17-12. Solve the one-dimensional diffusion equation for an infinite region under the 
initial condition c{x 9 1) — 8(jc — jc 0 ). This solution is called the fundamental solution of the 
one-dimensional diffusion equation. 

17-13. A two-dimensional Fourier transform pair has the form 
c(k, t) (2 tt) " 1 1 | e‘* 'c(r, t ) dr 

c(r, t) = (2ir) - * J* J e~‘ kr c(k, /) dk 

where k (fc x , k y ) 9 r = (x, y\ dr dx dy 9 and dk = dk x dk y . Show that if c(r, t) is spherically 
symmetric, then 

-CO Jin 

c(k, t) (2tt) ~ 1 f drrc(r 9 t) I e ,kr cos 6 d6 
* o “^0 

The integral over d6 here is one of the defining equations for the zero-order Bessel function 
Jo 9 



Using this and the fact that / 0 (*) is an even function, show that 
c(k, t)— f rJ 0 (kr)c(r, t) dr 


and 


c(r, t) = kJ 0 (kr)c(k, t) dk 
J o 

Compare this to the three-dimensional case given by Eqs. (12-51) and (12-52). 

17-14. Using the result of the previous problem, find the fundamental solution of the 
two-dimensional diffusion equation. (The necessary integral is standard and can be found, 
for example, in Handbook of Mathematical Functions 9 edited by M. Abramawitz and I. 
Stegun, New York: Dover.) Show that the resulting c(r, t) is normalized. 

17-15. Show that the solution to the one-dimensional diffusion equation for an unbounded 
region with initial condition c(x 9 0) = /( x) is 


c(x 9 1) ~ (AirDt )- 1/2 


r 00 

I du 

— m 


Note that this is simply the convolution of the fundamental solution (the Green’s function) 
and the initial condition. 

17-16. Use the fundamental solution to the three-dimensional diffusion equation [Eq. 
(17-53) with Co = 1] to show that the mean-square displacement of a diffusing particle is 
6Dt in three dimensions. Derive the same result direcdy from the Fourier transform of c(r, t) 
by showing that 

= - wt= - i L wc(r ’ ,)dr 

= -Kr 2 (/)> 


Discuss the implications of this concerning reversibility and irreversibility. 
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17-17. Show that 


dc d 2 c d 2 c 

eT D *w + D ’w + Dl 


d 2 c 

dz 2 


represents anisotropic diffusion and that its fundamental solution is 


dx, y, z, t) 


1 f (x-xc) 2 

8(t rtyi\D x D y D.Y' 2 ® XP L 42), ) 


(y—yoY (z— zo) 2 ~l 
4 Dyt 4 D z t J 


17-18. Show that 



dc dc _ dc d 2 c 

te + P*Ty +fi *¥z +D *'E7 2 + 


Dy 


d 2 c d 2 c 

b D- — 

dy 2 dz 2 


represents anisotropic diffusion with an anisotropic drift and that its fundamental solution is 

, 1 __r (x—Xo +pxt) 2 (y-yo+pyt) 2 

dx, y, Z, o- 8(7Tt)3/2(D xDyDz y >2 ex P[ 4D x t 4 D y t 

(z — z 0 + /3z/) 2 1 
4 Dyt J 

17-19. The Debye theory of rotational relaxation is based upon the rotational diffusion 
equation (see, e.g., A. Carrington and A. D. McLachlan, Introduction to Magnetic Resonance , 
Appendix H) 


dp 

dt 


= D R V 2 p 


where p = p(6, <£, /) is the probability that a molecular axis points in the direction (0, <f>) 9 D R 
is the rotational diffusion coefficient, and V 2 is the angular part of the Laplacian operator 
in spherical coordinates, i.e., 

1 d / . d\ Id 2 
V2 "" tiTO TO ^ Sin 6 dOf + sin 2 0 d</> 2 

Using the fact that the spherical harmonics Yi m (0> <f>) satisfy the equation 

V 2 T Im = -/(/+ i)Y lm 

and assuming that the initial probability distribution is a delta function concentrated at 
0 = $ = 0, i.e., that 

p(Q, <f>, 0) = ^ 8(cos 6—1) 

show that 

f<fl. 4 >, 0 = 2 (^)a(cos 

and that 


<cos 0(/)> = 



dO sin 6 cos 0 p(0 9 <f> 9 1) 


— e~ 2DRt 


and that 

<i[3 cos 2 0(f) — 1]> = e ~ 6 ° Rt 
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17-20. Consider the one-dimensional diffusion equation for the region — oo < x < a 9 
with an absorbing wall located at jc — a. Let the initial condition be simply c(jc, 0) = 8(jc). 
Thus we must solve the partial differential equation 

dc d 2 c 

-r~ = D-— x<a 


subject to the conditions 

c(x 9 0) - 8(x) 
c(o 9 0=0 

This last condition is the mathematical statement of an absorbing barrier. 

First show that the solution to the unbounded problem with initial condition c ( jc , 0) = 
S(x — jc 0 ) is 

c(x, t; xo) = (4t TDt)~ i/2 exp | | 

Clearly c(x, t; 0) satisfies the partial differential equation and the initial condition, but not 
the boundary condition. For jc 0 > a, show that the linear combination 

c(x 9 0 = c( x> t\ 0) + Ac( x, t; x 0 ) 

also satisfies both the partial differential equation and the initial condition. The procedure 
now is to look for an A and x 0 such that the boundary condition is also satisfied. 

Notice that the two terms in this trial solution can be thought of as the concentration 
profiles resulting from initial sources located at x — 0 and x = x 0 . Thus we shall refer to 
them as source terms. The first term is real in the sense that the diffusion does start from the 
point x = 0, but the existence of the absorbing barrier located at jc = a prevents this from 
being a solution to the problem. This first term alone predicts that the concentration at 
x = a would be c(a 9 t; 0) = (47rDt)~ U2 exp(—a 2 /4Dt) instead of zero as required by the 
absorbing barrier. To overcome this, we can place a fictitious source beyond the barrier in 
such a way that the concentration at the absorbing barrier is zero. Such a fictitious source is 
called an image source , and the method being introduced here is called the method of images . 

Clearly, by symmetry the image source should be placed at jc = 2 a and should be done so 
with a negative weight so that it is actually an image sink rather than an image source. Thus 
we see on physical grounds that we should choose jc 0 - 2 a and A — 1 . Show that the com- 
plete solution to the problem (partial differential equation, initial condition, and boundary 
condition) is given by 

c(*. o = (4„Z>,)— {exp(— - exp[— } 

Note that the solution to the problem with a barrier can be written down in terms of the 
fundamental solution c( jc , /; jc 0 ). Sketch this solution, showing how the source term and the 
image sink term contribute to the total solution c ( jc , /). 

17-21. Use the method of images introduced in the previous problem to solve the diffusion 
problem with two absorbing barriers, located at ±a. The problem to be solved is 


dc d 2 c 
dt ^ dx 2 


- a < jc <a 


c(a 9 t ) c(—a 9 1) = 0 

c(x 9 0) = S(x) 


Show that the solution to the two absorbing barrier problems requires a doubly infinite 
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system of images, with a set of “unit sources” located at the points x„' =4na (n = 0, ±1, 
± 2, . . .) and a set of “ unit sinks ** located at the points x n " = 2a{\ — 2 n\ and thus 

17-22. Solve the previous problem by the method of separation of variables. The solution 
is 


“ 1 . /wr\ 2 2 . [wr(* + <01 

;■“( t)'~ " “‘l - s - J 


Note that this appears to be different from the one obtained from the method of images in the 
previous problem. Of course the two solutions are equivalent, and the two series are alter- 
native representations of the same function. 

17-23. Show that the equation 


dc d 2 c dc 

Tt = d ^- (X T x 


represents a system both diffusing and moving with a constant drift velocity [jl. Show how 
Fick’s law is modified for this case. This equation could represent the diffusion of a large 
charged particle such as a protein in a uniform electric field (cf. Section 22-3). 

17-24. Show that the fundamental solution to the equation in the previous problem is 

[ (x — x 0 — uf ) 2 1 

4Dt J 

17-25. Use the method of images introduced in Problem 17-20 and the fundamental 
solution given in the previous problem to solve the problem of diffusion with a bias or a 
drift with an absorbing barrier located at jc — a. In other words, solve the equation 

dc d 2 c dc 

dt dx 2 p dx 


along with the conditions 

c(x, 0) S(x) 
c(a, t) = 0 

17-26. Repeat the previous problem, but this time with two absorbing barriers, located 
at ±a. This is also the solution to Problem 17-21 with a drift. 

17-27. Consider the flow of a fluid between two parallel plates, a distance b apart, where 
one of the plates moves with velocity U relative to the other. Take the velocity to be in the 
^-direction and the perpendicular to the plates to be the ^-direction. Following the deriva- 
tion of Eq. (17-80), find u x {y) for this system. This type of flow is called plane Couette flow. 

17-28. In plane Couette-Poiseuille flow, the geometry is similar to that in the above 
problem, but there is a constant pressure gradient dp/dx in addition to the relative motion 
of the plates. In other words, this problem is a combination of the previous one and the plane 
Poiseuille flow discussed in the chapter. Find u x (y) for this type of flow. 

17-29. If the plates bounding the flow in the previous two problems are parallel but not 
horizontal, the solution is found by the same procedure that led to Eq. (17-80), except that 
if the jc-direction is still taken to be the direction of flow; then dp/dx should be replaced by 
dp/dx — pmg sin and dp/dy by —pmg cos jS, where /9 is the angle that the jc-axis makes 
with the horizontal (cf. Fig. 17-7). Find u x (y) for this case. 



PROBLEMS 401 


y 



Figure 17-7. Geometry of the problem of plane Couette-Poiseuille flow down an inclined plane. 

17-30. Show that Eq. (17-81) is the appropriate form of the Navier-Stokes equation for 
flow through a cylindrical capillary. 

17-31. Show that Eq. (17-86) is the complementary solution to Eq. (17-84). 

17-32. Derive Eq. (17-30). 



CHAPTER 18 


KINETIC THEORY OF 
GASES AND THE 
BOLTZMANN EQUATION 


In Section 7-2 we introduced the concept of phase space and distribution functions in 
phase space. We also derived the Liouville equation, which is the equation of motion 
that the phase space distribution function must satisfy. Since we were interested only in 
equilibrium statistical mechanics at that time, we did not consider the Liouville 
equation in any detail. In this chapter we shall review the concept of phase space and 
derive the Liouville equation again. We shall then introduce reduced distribution 
functions and derive the Bogoliubov, Born, Green, Kirkwood, Yvon (BBGKY) 
hierarchy. This hierarchy is the nonequilibrium generalization of the Kirkwood 
integral equation hierarchy for the fluid distribution functions, gf (n) (r 1 , . . . , r n ), of 
Chapter 13. Nobody has yet devised a successful way to uncouple the BBGKY 
hierarchy, and so in Section 18-4 we shall derive a physical, yet approximate, equation 
for the distribution function for gases. This equation, called the Boltzmann equation, 
is the central equation of the rigorous kinetic theory of gases. In Section 18-5, we shall 
derive some of the general consequences of the Boltzmann equation that can be 
determined without actually solving it completely. We shall discuss its solution in 
Chapter 19. The standard reference for most of this chapter is Hirschfelder, Curtiss, 
and Bird. Mazo (in “Additional Reading ”) also discusses these topics well. 

18-1 PHASE SPACE AND THE LIOUVILLE EQUATION 

Consider a system of N point particles. The classical dynamical state of this system 
is specified by the 3N momentum components p l9 p 2 , ..., p 3N and the 3N spatial 
coordinates g l9 . . . , q 3N . We can construct a 67V-dimensional space whose coordinates 
are q u q 2 , p l9 . . . , p 3N . One point in this phase space completely specifies the 
microscopic dynamical state of our A-particle system. As the system evolves in time, 
this phase point moves through phase space in a manner completely dictated by the 
equations of motion of the system. Actually, one never knows (nor really cares to 
know) the 6N coordinates of a macroscopic system. Rather, one knows just a few 
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macroscopic mechanical properties of the system, such as the energy, volume, velocity, 
etc. Clearly there are a great number of points in phase space that are compatible 
with the few variables that we know about the system. The set of all such phase 
points constitutes an ensemble of systems. The number of systems in an ensemble 
approaches infinity, and so the set of phase points that could possibly represent our 
system becomes quite dense. This allows us to define a density of phase points or 
distribution function as the fraction of phase points contained in the volume dq t dq 2 
* • • dp 3 fl . We shall denote the phase space distribution function by f N (q x , q 2 , 
p 3N , t), or more conveniently by f N (p, q 9 1). We shall often use this abbreviated notation. 
Similarly, we shall often denote dq u dq 2 • • • dp 3N by dp dq. The density f N (p, q, t) is 
normalized such that 

J7w(P, <1, 0 dp dq = 1 

Since each phase point moves in time according to the equations of motion of the 
system it describes,^ itself must obey some sort of equation of motion. The equation 
that f N (p, q, t) satisfies can be readily determined by using the methods of the previous 
chapter, particularly, the argument associated with Eqs. (17-1) to (17-5). The number 
of phase points within some arbitrary volume v is 


n = N J f N (p, q, t) dp dq 


where we are using the condensed notation of letting p and q denote all the spatial 
coordinates and momenta necessary to specify a system in the ensemble. The rate of 
change of the number of phase points within v is 



Since phase points are neither created nor destroyed, the rate of change of n must be 
given by the rate at which phase points flow through the surface enclosing v. The rate 
of flow of phase points is Nf^u, where u is not just the 37V-dimensional vector 
(q u q 2 , . . . , q 3N ), but the 67V-dimensional vector (q l9 . . . , p l9 . . . , p 3iV ) since the spatial 
coordinates and momenta play an equivalent role in phase space. We integrate this 
flow over the surface to get 


dn 

di 


= — N 



udS 


The negative sign here indicates that an outflow of phase points yields a negative 
value for dn/dt since u ■ dS is positive if u is directed outward from v and negative if u 
is directed inward. 

The surface integral can be transformed to a volume integral by using Gauss’ 
theorem to get 


^ = ~ N j V (fN*)dpdq 


(18-2) 


If we subtract Eq. (18-1) from Eq. (18-2) and realize that this equation is valid for any 
choice of v, we have the equation for the conservation of phase points 


^ + V(/ w u) = 0 


(18-3) 
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in which it should be clear that since we are dealing with phase space 
U = (4l> • • • » <Z3Af » Pi* • • • » P3N ) 

and 

3 N d 3W J 

V /„U=i J-Un4j)+ E J-(fnPj) 

j=ioqj- j=iopj 

= il dqj qj dpj Pj \ j = i \dqj dp) N 

But Eq. (7-27) shows that the summand of the second summation here is zero, and so 
Eq. (18-3) becomes 


df H A df N df N . 

-T7 + L T~ + £ J-Pj = 0 

dt j= i dqj ;=i dpj 

Using Hamilton’s equations of motion, 

dH i . dH 

Pi=~Y7 and 3. = JT 
dq t dpi 

Eq. (18-4) can be written 

d A + f ( d JL d AAJL d A\ = 0 

dt 7=1 \dpj dq j dqjdpJ 


(18-4) 


(18-5) 


The summation here is called a Poisson bracket and is commonly denoted by {f N , H}\ 
so Eq. (18-5) is often written as 


^ + {/ n ,7/} = 0 


(18-6) 


This is the Liouville equation, the most fundamental equation of statistical mechanics. 
In fact, it can be shown that the Liouville equation is equivalent to the 6N Hamilton 
equations of motion of the N-body system.* 

In Cartesian coordinates, the Liouville equation reads 


f? + £- 

dt j=inij 


N 


V w +lF r V PJ / w = 0 

j= 1 


( 1 8 - 6 ') 


In this equation V rj denotes the gradient with respect to the spatial variables in f N ; 
V PJ denotes the gradient with respect to the momentum variables in f N ; and F j is the 
total force on the jth particle. 

One often sees the Liouville equation written as 




(18-7) 


where L is the Liouville operator. 


L = 



'•*») 


(18-8) 


* Mazo, “Additional Reading,” p. 23; M. Beran, Amer. J. Phys., 35, p. 242, 1967. 
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The Liouville operator has been defined in such a way as to bring the Liouville equa- 
tion into the form of the Schrodinger equation. A formal, and sometimes useful, solu- 
tion to Eq. (18-7) is 

M P, r, 0 = e~ iu f N ( p, r, 0) (1 8-9) 

Note that the operator exp(— iLt) displaces f N ahead a distance t in time. This operator 
is called the time displacement operator of the system. 


18-2 REDUCED DISTRIBUTION FUNCTIONS 

Once we have the distribution function f N (p, < 7 , t), we may compute the ensemble 
average of any dynamical variable, A(p, q, t), from the equation 

</l(t)> = | A(p, q, t)f N (p, q, t) dp dq (1 8-1 0) 

It turns out that the dynamical variables of interest are functions of either the coordin- 
ates and momenta of just a few particles or can be written as a sum over such functions. 
A familiar example of this is the total intermolecular potential of the system. To a 
good approximation, this can be written as a sum over pair-wise potentials, and so 


< t/ > = Zf fwOr.-. «fj)/w(r 1 ,...,p w> 0*i"-^Pw (18-11) 

We encountered similar integrands when we studied the equilibrium theory of liquids. 
There we integrated over the coordinates of all the particles except i and j and called 
the resulting function of r £ and Tj a radial distribution function. We do the same thing 
here. We define reduced distribution functions / /v <n) (r 1 , . . . , r n , p 1? . . . , p n , t) by 


• • •> r n> Pi. • • •> P„> 0 = ( N J n y / J/w( r i» • • • • P*r. 0 

*fr n+1 • ^ivP„+i -dp N (18-12) 


We shall usually drop the N subscript and furthermore write this simply as / <B) ( r”, p B , t). 
Usually only / (1) and / <2) are necessary, and therefore we want to derive an equation 
for / (1) and / <2) . To do this, write the force F 7 appearing in the Liouville equation as 
the sum of the forces due to the other molecules in the system £,• F ;j - and an external 
force Xj. Then multiply through by N\/(N — n) ! and integrate over dr„ +l 
dr N dp n+l ■ ■ ■ dp N to get (Problem 18-2) 


^ + t ^ + t Xy • V PJ / (B > 

ot j = i mj j= i 

N 1 N r r 

} F ij m Vvjf d r n+i mmm dr N dp m+l •••dp N = 0 (18-13) 


We have used the fact that / vanishes outside the walls of the container and when 
p ; = +oo. The last term in Eq. (18-13) can be broken up into two parts: 


I Fy - V PJ /< B > 


*, J = 


N\ n N r r 

+ _ n y (= Z t J J Fy • V pj /dr n+ 1 • • • dr N dp„ + 1 - dp N 
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The second term here can be written as 


£ JT F J>n + 1 ' V pj / (n+ dr n+t dp n+1 

j= 1 

Putting all this together finally gives an exact equation for / (n) , namely, (Problem 18-3), 

Kr+i zr- v r J (n) + £ Xj • V p J ( "> 


dt j = i ntj 


j=i 


"b ^ F ij " “b ^ V p / (n+1) dr n+1 d Vn+1 =0 (18- 

Ui= l 


14 ) 


This is the so-called Bogoliubov, Bom, Green, Kirkwood, Yvon (BBGKY) hier- 
archy. This is the time-dependent generalization of the hierarchy that we derived 
earlier in the equilibrium theory of fluids. In fact, if one assumes that 


s*-™ 


multiplies Eq. (18-14) through by p f , 1 < i < n, and integrates over all momenta, one 
obtains the equilibrium hierarchy for g {n \r u . . . , r n ) (Problem 18-4). It would seem 
natural at this point to tuncate this hierarchy by some sort of a superposition approxi- 
mation, but so far this approach has not been successful.* We shall end up deriving 
approximate equations for / (1) and / (2) . 

Everything we have done up to now has been independent of density; i.e., it has 
been applicable to any density. Now we shall specialize to systems of dilute gases. 


18-3 FLUXES IN DILUTE GASES 

In a dilute gas, most of the molecules are not interacting with any other molecule 
and are just traveling along between collisions. Because of this, the macroscopic 
properties of a gas depend upon only the singlet distribution function r, p 7 , t). 
The subscript j here denotes the singlet distribution function of species j. This is the 
central distribution function of any theory of transport in dilute gases. In this section 
we shall define a number of averages over f/ l) and derive molecular expressions for 
the important flux quantities in terms of integrals over f/ l) . Since we shall be con- 
cerned only with gases in this and the following sections, we shall drop the superscript 
(1) from here on. We shall also write our equations in velocity space rather than 
momentum space, and so the distribution function of interest becomes fj( r, y j9 t). 
We shall renormalize such that the integral of this distribution function over all 
velocities is the number density of j particles at the point r at time t 9 i.e., 

Pj(r, 0 = jfjr, Vj , t ) dvj (1 8-1 5) 

Furthermore, if Nj is the total number of j molecules in our system, then 

N J = If ^ T ’ Vj ’ dt dy i (1 8-1 6) 

We shall now define a number of important average velocities. \j is the linear 


See, for example, R. G. Mortimer, J. Chem. Phys ., 48, p. 1023, 1968. 
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velocity of a molecule of species j; i.e., it is the velocity with respect to a coordinate 
system fixed in space. The average velocity is given by 


1 r 

▼A. 0 = - V(r, V, , 0 dVj 
Pj J 


( 18 - 17 ) 


and represents the macroscopic flow of species j. The mass average velocity is defined by 


v 0 (r, t) 


Tj m jPj 


(18-18) 


Note that the denominator here is the mass density pjr, t). This velocity is often 
called the flow velocity or stream velocity. The momentum density of the gas is the 
same as if all the molecules were moving with velocity v 0 . The peculiar velocity is 
the velocity of a molecule relative to the flow velocity. The peculiar velocity \j is 


V, = v i “ v o 08-19) 

The average of this peculiar velocity is the diffusion velocity (Problem 18-5). Clearly, 

Y/ = 7 - J ( Vj - v o)/j( r . Vj , 0 dVj (1 8-20) 

Pj J 

It is easy to show that (Problem 18-6) 

LPj m jVj = ° 08-21) 

j 

When we studied the elementary kinetic theory of gases, we saw that the various 
transport coefficients were related to molecular transport of mass, momentum, and 
kinetic energy. Let these molecular properties be designated collectively by ij/j , where 
j refers to the particular species. We now derive expressions for the fluxes of these 
properties. Figure 18-1 shows a surface dS moving with velocity v 0 . The quantity 
n is a unit vector normal to dS 9 and dS = n dS. All the molecules that have velocity 
\j = \j — v 0 and that cross dS in the time interval (t, t + dt) must have been in a 
cylinder of length | \j\ dt and base dS. This cylinder is shown in Fig. 18-1 and has a 
volume (n • V,) dS dt. Since there are fj ds } molecules per unit volume with relative 
velocity \ j9 the number of j molecules that cross dS in dt is given by 

(fj d\j)(n -Yj)dS dt 



Figure 18-1. The cylinder containing all those molecules of species j with velocity V j9 which cross the 
surface dS during the time Interval dt. (From J. O. Hirschfelder, C. F. Curtiss, and R. B. 
Bird, Molecular Theory of Gases and Liquids. New York: Wiley, 1954.) 
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If each molecule carries with it a property \j/j , then the flux of this property is 
iMX n ' V y ) dfj 

and the total flux across this surface is 

total flux = J ij/jfjn ■ \j) d\j = n • J ijjjfj V,- dvj = n • x]/, (1 8-22) 

The vector t|/ 7 - , 

A 'j = j'l'jfjVjA'j (1 8-23) 

is called the flux vector associated with the property ipj . The component of this vector 
in any direction is the transport of the property \}/j in that direction. Let us now 
consider the various examples of \j/j . 

TRANSPORT OF MASS 

In this case, ij/j = nij , and 

'1'j = Vj dVj = Pjtrij \j = \j (1 8-24) 

TRANSPORT OF MOMENTUM 

Here \pj = mj V jX , and 

tyj = m i f VjJj V J dy j = pj m i Vj X Vj (18-25) 

which is the flux of the x-component of momentum relative to v 0 . The flux of momen- 
tum is a pressure, which has components 

(pj) XX = Pj Mj Vjx Vjx 
(Pj) X y = PjMj V jx V jy , etc 
or, in general, 

Pj = PjMjVJ\j (18-26) 

which is the partial pressure tensor of the yth species. 

TRANSPORT OF KINETIC ENERGY 

= iMj V/ 

and 

tyj = y / v j 2 Vjfj ^ = ipj mj V/Vj = q y (1 8-27) 

the heat flux vector of the yth species. 

It should be clear at this point that once we have an expression for //r, v 7 - , t), we 
can calculate all the fluxes and hence all the transport properties of a dilute gas. What 
we need now is f i , or at least an equation that gives as its solution. The only equation 
we have up to now is Eq. (18-14) with n = 1, and it can be seen that this also contains 
f/ 2 \ As we said earlier, nobody has found a successful way to uncouple this system. 
In the next section we shall derive an equation for fj , the Boltzmann equation, which 
is the fundamental equation of the rigorous kinetic theory of gases. 
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18-4 THE BOLTZMANN EQUATION 

In this section we shall derive the Boltzmann equation for fj by a simple, physical 
argument. The gas is assumed to be dilute enough that only two-body interactions are 
ever important. The number of 7-molecules in the phase space volume element dr dvj 
at the point (r, v y ) is given by fj dr dvj . In the absence of collisions in the gas, the 
molecules at the point (r, Vy) at time t move according to the equations of motion of 
the system and arrive at the point (r + y, dt, v f + (l/mffXj dt) at the time (t + dt). 
The quantity Xy is an external force. Because all the points that start out end up at the 
same point (in the absence of collisions), we have that 

// r, \j , 0 = //|r + Vy dt, Vy + ^dt,t + dtj (no collisions) (1 8-28) 

But since collisions occur in the gas, not all those molecules that start out at the point 
(r, v y ) at time t end up at (r + Vy dt, Vy + (l/my)Xy dt) at the time (t + dt). Some mole- 
cules leave this stream because of collisions, and furthermore some molecules enter 
this stream because of collisions. Let the number of 7-molecules lost from the velocity 
range (Vj , Vy + dvj) and the position range (r, r -I- dr) because of collisions with /-mole- 
cules during the time interval (t, t + dt) be ry- } dr d\j dt. Similarly, let the number of 
/•molecules that join the group of molecules that starts at (r, Vy) at time t because of 
collisions with /-molecules be Fy/ +) dr dvj dt. If we now include these collision terms 
in Eq. (18-28), we can write 

fj(r + Vy dt, Vy + my _1 Xy dt, t + dt) dr d\j 

=/}(r, v y , 0 dr dvj + £ 0V +) - r yi (-) ) dr ds } dt (1 8-29) 

£ 

If we expand the left-hand side of the equation, we can get (Problem 18-8) 


f + v v r /y +§-v v / J . = £ (ry + > - r y /->) 


m, 


(18-30) 


The left-hand side of the equation represents the change in fj due to the collisionless 
motion of the molecules, called streaming, and the right-hand side represents the 
change in /y due to collisions. Notice that this equation looks very similar to the 
Liouville equation for /} (1) . 

We now want to find an explicit expression for the collision terms in this equation. 
Let us look at T ji ^ ) dr dsj dt first. Consider a molecule of type j located at r with 
velocity Vy . The probability that this molecule will collide with an /-molecule in the 
time interval dt and with impact parameter in a range db about b is given by the 
following argument. (See Fig. 18-2). If molecule j is considered to be fixed, the /-mole- 
cule approaches it with a relative velocity (v £ — Vy) = g y . If A in Fig. 18-2 is the 
range of the intermolecular potential, any /-molecule within the cylindrical shell 
indicated in the figure will collide with the fixed /-molecule during the time interval dt. 
The probable number of /-molecules within this cylindrical shell is 


2nfi(r, Vj , t)g u bdbdt 

where g { j = |gy|. The total number of collisions that would occur with this one 
fixed y-molecule is 

In dt JJ./;.(r, v ; , t)gtjb db dv t 
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Figure 18-2. Collisions of molecules of type i with one molecule of type j. The distance A is essentially the 
intermolecular distance at which the potential begins to “take hold.” (From J. O. Hirsch- 
felder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Gases and Liquids. New York: 
Wiley, 1954.) 

The probable number of molecules of type j located in the volume element dr about r 
with velocity between Xj and Xj + dx } is f } { r, \j , t) dr dxj . Therefore we write 

T Ji <_) dr dvj dt = 2 n dr dv } dt^ffi, v Js t)f£r, v f , t)g u b db dv t 
and see that 

= 2 n \\fjfi3ijb db dv, (1 8-31 ) 

We should notice at this point that we have assumed that the mean number of 
i-molecules about the fixed 7-molecule is given by the product of f t and \ i.e., we are 
assuming that their positions and velocities are uncorrelated. The assumption, 
known as the molecular chaos assumption or the stosszahlansatz, is not strictly true 
and is the weakest point in the entire derivation. This turns out to be a very important 
point, and we shall discuss this later on when we do the critique on the Boltzmann 
equation. 

We can apply the very same argument that we used to derive Eq. (18-31) 
to the inverse collisions, i.e., those that scatter particles into the point (r + Xj dt, 
Xj + (1 lmj)Xj dt). We can immediately write 

iy +) dr dsj dt = In dr dv/ dt JJ/Xr, v,', t)f/r, v/, t)g t /b' db' dv / (1 8-32) 

where we use primes to indicate those quantities before the collision which will go 
over into b, v £ , and Xj after the collision. They can be computed from b, x t , and Xj by 
solving the collisional equations of motion. Liouville’s theorem says that 

dr dx t dxjQij dt b db = dr dx{ dx/g dt V db' (1 8-33) 

Using this, we can rewrite Eq. (18-32) in the form 

ry +) = In JJ/i(r, Vi, t)fj(r, v/, ^g^b db dv- t (1 8-34) 

= 2nfffi'fj' gi jb db dVi (1 8-35) 

where the primes on the and indicate that the velocity arguments of these func- 
tions are primed. 
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Substituting Eqs. (18-31) and (18-35) into Eq. (18-30) finally gives the Boltzmann 
equation : 


dfj + ij ■ VJj + ^Xj V v /j = 2*111 db dy t 


dt 


m j 


(18-36) 


This is an integrodifferential equation for f/ l) . We have one such equation for each 
component of the gas. Notice that the equations of motion, and hence the intermolec- 
ular potential, enter this equation implicitly in the integrand on the right-hand side. 
The functions f- and // depend upon the postcollisional velocities v/ and v/, which 
depend upon the precollisional velocities v f and v y through the equations of motion 
governing the collision. Notice also that once the right-hand side is integrated over b 
and v £ , the only variables left are r, v y , and t, exactly those variables on which the left- 
hand side depends. 

For several reasons which we shall discuss, the Boltzmann equation is a remarkable 
equation. The derivation we have presented here is the standard derivation and has the 
advantage of being simple and physical. 

The Boltzmann equation is not only an integrodifferential equation, but a nonlinear 
one as well. Solving such an equation is not simple, but it turns out that there exists a 
very elegant and straightforward approximate scheme due to Hilbert, Chapman, and 
Enskog, which we shall present in the next chapter. Before doing this, however, we 
can extract some very interesting consequences from the Boltzmann equation without 
a great deal of work. 


18-5 SOME GENERAL CONSEQUENCES 
OF THE BOLTZMANN EQUATION 

First, we shall discuss the equations of change. The fundamental equations of con- 
tinuum mechanics that we derived in Chapter 17 may be obtained from the Boltzmann 
equation without determining the form of /. If we multiply the Boltzmann equation for 
fi by ipi and integrate over , we obtain 

Ml + '.v./ 1+ ±x l -v./,} < (., 

= 2n£ JJJ HAY/ -fifj)9ijb db dy t dvj (1 8-37) 

Each of the three terms on the left-hand side of Eq. (18-37) can be transformed into 
more convenient forms: 

d , _ Y Wi 

= (18-38) 

J>, r„ fj *, - | J *, »„/, 

O 8 - 39 * 

/*■ tot “ If " *•’ *« - ! f ‘ Si dy ‘ 


(1 8-40) 
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We have assumed that the external force X £ appearing in Eq. (18-37) is independent of 
v £ . Putting these results together, we can write Eq. (18-37) as 




+ V r • - Pi 

i 


#,• 


+ v,- V r ^i + — • V v >i 


8t ' r r,T * ‘ ri \8t 

= 2n ISB Ufi'fi -fifj)9ijb db d\i (hj 




(18-41) 


This is known as Enskog’s general equation of change for the property ^ £ . We shall 
now show that if ^ £ is any of the collisional invariants, i.e., m i9 m £ v £ , or \ m^ 2 , the 
right-hand side of Eq. (18 -41) vanishes when summed over i. To prove this, first 
notice that the integral 


J]J ^ if iff -fJj)Qij b db dy t dvj (1 8-42) 

is equal* to the integral 

JJJ <Pi(ffj ~fifj)dijb' db’ d\ t ' dr/ (1 8-43) 

which is written in terms in inverse collisions (Problem 18-9). This is so because an 
integral over a collision in one direction must be the same as an integral over a collision 
in the opposite direction. This can be seen from Fig. 16-3. Furthermore, from 
Liouville’s theorem and from the dynamics of molecular collisions, we know that 


Qij = g t / 9 b = b\ dsj = ds{ d\/ (1 8-44) 

and so Eq. (18-43) may be written as 

-/// wifi -fifj)9ijb db dr t dyj (1 8-45) 


Since the integrals of Eqs. (18-42) and (18-45) are equal to each other, they must also 
each be equal to one-half the sum of the two. This gives us that (Problem 18-10) 

JJJ U Ufi —fifj)9ijb db dy { dyj 

= i JJJ - 4'i')(fi'f; -man b db dy t dyj (1 8-46) 


Note that if i// £ = m £ , then the integrand equals zero. This is simply because the mass of 
the ith species is conserved in an elastic collision. 

Now if we sum Eq. (18-46) over both i and j and then interchange the dummy 
indices i and j 9 we get 

i Z JJJ Wi - h'Xfi'ff -fifj)9tj b db dy t dyj 

= i Z ffl (t- - Wifi -fifjbu b db dy. t dyj (1 8-47) 


Since these two integrals are equal to each other, each equals one-half of the sum of the 
two, and so we can finally write (Problem 18-11) 


Z JJf Miff -fifj)9ijbdb dy t dyj 

= i Z JJJ (*« + - *i ~ Wifi' -fifj)9ijb db dy t dyj 


(1 8-48) 
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Now \pi + ij/j — \j/i — \p/ vanishes when ^ £ = m i9 m £ v £ , or im £ i; £ 2 , and so we have 
shown that the sum of the right-hand side of Eq. (18-41) over i equals zero. This 
leaves, then, 


j t I (pifo) + v r -E (Pi M - X pS ^ + • v r ^ ^ • v vt *,} = o 


(18-49) 


as the general equation of change. 

If we let ij/i = m j9 Eq. (18-41) becomes 


fa 

dt 


+ V, • (p.Vf) = 0 


(18-50) 


This is the continuity equation for the ith species. If we sum this over i and use 
Eq. (18-18), we get the continuity equation for the entire system: 

dp 

l£ + V r -(pv 0 ) = 0 (18-51) 


If we let ij/i = m £ v £ and sum over i, Eq. (18-49) becomes (Problem 18-12) 


= (18-52) 

where p is the pressure tensor defined in Eq. (18-26). Similarly, if we let i// £ = iw £ t; £ 2 , 
for example, we get the energy balance equation, Eq. (17-31). (See Problem 18-13.) It 
is reassuring, although not surprising, that these equations should come out of the 
Boltzmann equation. The next general consequence we shall derive is not particularly 
obvious and, in fact, is a rather profound result. 

Next we discuss the so-called Boltzmann //-theorem and the equilibrium solution 
to the Boltzmann equation. Consider a one-component system. The Boltzmann 
//-function is defined by 

m = jj/( r > v . 0 l n /( r . v. 0 dr dv (1 8-53) 

Differentiate H(t) with respect to t: 

^ + ^ <,8 - 54) 


The second term here vanishes since 



if the number of particles in the system is conserved. Equation (18-54) becomes then 




(18-55) 


To evaluate this integral, multiply the Boltzmann equation by In / and integrate over 
dr and d\. This gives 

JJ Inf dr dy = - JJ (ln/)v • VJdr dv - jj (In f) ^ • VJdr dv 

+ 2n JJJ ln/{/'/i' -ff}gb dbdvdv t 


(1 8-56) 



414 KINETIC THEORY OF GASES AND THE BOLTZMANN EQUATION 


The subscript 1 in the collision integral here is to distinguish the two colliding mole- 
cules. The first two integrals on the right vanish since we assume as always that / 
vanishes at the walls of the container and as v-» ±oo (Problem 18-14). This leaves 

d -§ = 2tt JJJ ln/{/// -fftgb db dv dv t (1 8-57) 

This integral may be symmetrized by the same method we used to derive Eq. (18-48) to 
give (Problem 18-15) 

™ J /// -ffi)9b db dY dYy (1 8-58) 

Now this integrand is of the form — (x — y) In (x/y). If x > y 9 this function is negative; 
if x < y, it is also negative; and if x = y, it is equal to zero. Therefore, we get the result 
that 


dH ^ 

— <0 (18-59) 

dt 

The definition of H(t) shows that it is bounded, and hence H(t) must approach a 
limit as t -> oo. In this limit, dH/dt = 0, and so we have an equilibrium or steady-state 
situation with 


ffi =ffi 


or 

ln/'+ln// = ln/+ln/i 


(1 8-60) 

(18-61) 


This tells us that In/ is a summational invariant. However, we know that the only 
summational invariants of bimolecular collisions of spherically symmetric molecules 
are the mass, the momentum, and the kinetic energy. Therefore, In / must be a linear 
combination of these three quantities, or 


In f = am + $ • (m\) — 



= am 


m pp 

+ TT — 

2 y 



(1 8-62) 


If we let am + mfi • p/2y be In c, then 


ln/=lnc-^(v-?) 
/(r,v) = cexp[-^(v-?) ] 


(18-63) 


Notice how closely this resembles a Maxwellian distribution. We can determine our 
unknown parameters c, P, and y from the following conditions. First, we have 


P(r, t) = jfdv = c Jexp j d\ 


(18-64) 


Evaluating the integral in Eq. (18-64) gives (Problem 18-16) 


p(r, i) = c 



1/2 


(18-65) 
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Similarly, we have (Problem 18-16) 



Lastly, we define temperature by the usual relation 

\kT = j (v - v 0 ) 2 

which gives (Problem 18-16) 

$kT=$y~ l 

Equations (18-65), (18-66), and (18-67) then give 


( 18 - 66 ) 


( 18 - 67 ) 


( 1 8 - 68 ) 


which is the classical expression for the Maxwellian distribution of velocities. Since p 
and T can be functions of position here, this is more generally a local Maxwellian 
distribution. This might be the distribution in a steady state rather than at equili- 
brium. We see then that the equilibrium solution to the Boltzmann equation is indeed 
the Maxwellian distribution, another reassuring, but nevertheless necessary, result. 

The really interesting consequence of the Boltzmann equation is the //-theorem, 
whose significance we have glossed over. The //-theorem attributes a direction in time 
to the Boltzmann equation since it states that if we start out with some arbitrary 
distribution function, it will relax to the equilibrium (or steady-state) Maxwellian 
distribution. Of course, one can say that this is a necessary feature of any equation 
we derive to describe a gas since we know from the second law of thermodynamics 
that systems tend toward their equilibrium states. But the fact that it comes out of the 
Boltzmann equation was at first severely attacked for the following reasons. 

The equations of motion of classical mechanics. 


m = -grad,- U(r u ...,r N ) l <j<N 


( 1 8 - 69 ) 


are symmetrical in time. If we let t -> —t in Eq (18-69), one gets the same equations 
back again. This means that classical mechanical systems have no preferred direction 
in time; motion in one direction is no more preferred than motion in the opposite 
direction. On the other hand, the //-theorem shows that the Boltzmann equation does 
have a preferred direction. From a purely mechanical point of view, if all the molecules 
move in such a way to make //decrease, there is at least a possible mechanical motion 
where everything is reversed, and if everything is reversed, H must increase. This is 
part of the more general question of how the irreversible processes that we observe in 
nature can be reconciled with the basic reversibility of the underlying mechanical 
equations of motion. The //-theorem is a particular expression of this situation. This 
objection was first raised by Loschmidt. Boltzmann tried to answer this by claiming 
that Eqs. (18-31) and (18-35) should be interpretted as the probability of a collision 
rather than the actual number of collisions, and this means that H(t) does, indeed, 
not always decrease, but that the probability that it decrease is far greater than it 
increase. The further the system is from equilibrium, the more likely H(t) is to decrease. 
Boltzmann’s arguments were not convincing to all of his critics, and it was left to 
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the Ehrenfests and Smoluchowski some years later to clearly explain the statistical 
nature of the Boltzmann equation. They were able to show that on the average, the 
//-function decreases with time to its equilibrium value, but that fluctuations can 
and will always occur. Furthermore, H will almost always remain near its equilibrium 
value once it gets there. We have encountered a somewhat similar situation when 
we discussed entropy in Chapter 2. Notice, in fact, that the equilibrium value of the 
//-function is equal to — S/k for an ideal gas. A decreasing //-function is in some sense 
an increasing entropy. The //-function can actually be computed as a function of 
time by molecular dynamics. In fact, one of the most powerful and interesting applica- 
tions of molecular dynamics calculations is to nonequilibrium systems. Figure 18-3 
shows H(t) calculated by Alder and Wainwright in their pioneering review article 
(in “Additional Reading”). 

If the objection by Loschmidt were not enough, there was also another paradox 
pointed out by Zermelo. This one was at the time more awkward than the time-reversal 
paradox. There is a general theorem in classical mechanics that says that any mechani- 
cal system enclosed in a finite volume will return arbitrarily close to its original state. 
This theorem is called the Poincard recursion theorem, and the time it takes to essen- 
tially return to its original state is called the recurrence time. Such a cycle is called a 
Poincare cycle. Zermelo vehemently pointed out that the //-theorem is at odds with 
the Poincare recursion theorem, since how could H evolve toward an equilibrium 
value and remain there when classical mechanics dictates that the system must even- 
tually retrace itself. Boltzmann was able to meet this objection as well by pointing out 
that for physically interesting systems, the recurrence times are ridiculously large. 
For example, Boltzmann estimated that a system consisting of 10 18 atoms per cubic 
centimeter with an average velocity of 5 x 10 4 cm/sec would reproduce all of its 
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Figure 18-3. A molecular dynamics calculation of the Boltzmann //-function for a hard-sphere system of 
100 particles at a density v/v 0 — 14.14. The //-function is plotted versus the collision 
number C. (From B. J. Alder and T. Wainwright, in Transport Processes in Statistical 
Mechanics , I. Prigogine, ed.. New York: Interscience, 1958.) 
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coordinates within 10“ 7 cm and all of its velocities to within 100 cm/sec in a time of the 
order of 10 1019 years! 

The scientific atmosphere of the time, however, was such that Boltzmann was not 
able to convince all of his critics, and it was finally Smoluchowski who clearly stated 
that the very concept of irreversibility is intimately involved with the length of the 
recurrence time. If one is initially in a state with a very long recurrence time, the 
process will appear to be ireversible. On the other hand, if the recurrence time is short, 
one does not speak about irreversibility. For any sensible physical system, the Poincare 
recurrence times are extremely long. 

The above results lead to the following picture of phase space. The overwhelming 
majority of phase space describes an equilibrium situation. Scattered throughout phase 
space are small regions that describe certain nonequilibrium configurations. Almost 
all of these nonequilibrium regions are completely surrounded by equilibrium regions, 
so that if we do happen to observe a system in a nonequilibrium state, it is almost 
always headed toward an equilibrium state. Once it reaches equilibrium, it will travel 
through various equilibrium states, only occasionally, rarely passing through other 
little regions of nonequilibrium. Such events are what we observe as fluctuations. 
This is so regardless of whether the phase point travels in a forward or backward 
direction, and so this picture is consistent with the classical mechanical requirement 
of time reversibility. Furthermore, the phase point must make its complete circuit 
through phase space before passing near its original position (since trajectories in 
phase space never cross). This length of time is the recurrence time. This would simply 
be observed as another fluctuation from equilibrium. 

These little nonequilibrium regions in phase space vary in extent. The smaller the 
region, the less time the system spends away from equilibrium. It is conceivable, 
however, that some nonequilibrium region of phase space is such that a phase point 
can be trapped there. For example, consider a gas prepared such that the velocities of 
all of its molecules are parallel to each other and perpendicular to the walls of the 
container. If we assume that the walls are perfectly smooth, the molecules would then 
travel back and forth between the walls, remain parallel to each other, and hence 
never collide and come to equilibrium. The phase point describing this gas would be 
confined to some peculiar region of phase space and get hung up there. Clearly 
the //-theorem and the Boltzmann equation itself would not be applicable to such a 
pathological system. The Boltzmann equation tacitly requires that the system be 
sufficiently chaotic. This leaves open the question of how to decide when a system can 
be described by the Boltzmann equation (or statistical mechanics for that matter). All 
that can be safely said is that such a difficulty never seems to arise in practice. In our 
pathological system above, for example, any slight imperfection or roughness in the 
walls of the container would send the molecules off their parallel paths, and the system 
would become sufficiently chaotic in just a few series of collisions. 

There are many long discussions about irreversibility, classical or quantum mechan- 
ics, and statistical mechanics in the literature, and the reader is referred to Tolman, 
Mazo, Huang, Uhlenbeck and Ford, and ter Haar for further discussions. (See 
“Additional Reading.”) 

It is interesting to note that Boltzmann (1844-1906) did his pioneering work at a 
time when atomic theories were not generally accepted. There were essentially two 
schools of scientific philosophy at that time: the atomists, led by Boltzmann himself, 
and the school of energetics, led by Mach, Ostwald, Duhem, and others. Remember 
that this was a prequantum mechanical time, and the inadequacies of classical 
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mechanics were becoming increasingly evident. For example, Boltzmann was never to 
understand why the internal degrees of freedom of molecules could not be successfully 
treated by classical mechanics. The extent of the controversy can be well summarized 
in Boltzmann’s foreword to Part II of his great work, Lectures on Gas Theory . He 
says, "As the first part of Gas Theory was being printed, I had already almost com- 
pleted the present second and last part, in which the more difficult parts of the subject 
were to have been treated. It was just at this time that attacks on the theory of gases 
began to increase. I am convinced that these attacks are merely based on a misunder- 
standing, and that the role of gas theory in science has not yet been played out. . . 

“ In my opinion it would be a great tragedy for science if the theory of gases were 
temporarily thrown into oblivion because of a momentary hostile attitude toward it, 
as was for example the wave theory because of Newton’s authority. 

“ I am conscious of being only an individual struggling weakly against the stream of 
time. But it still remains in my power to contribute in such a way that, when the theory 
of gases is again revived, not too much will have to be rediscovered. Thus in this book 
[this Part] I will now include the parts that are the most difficult and most subject to 
misunderstanding and give (at least in outline) the most easily understood exposition 

of them ” Boltzmann was clearly pessimistic about the future of the kinetic theory. 

This led to severe fits of depression, ending with his suicide in 1906. 

The Boltzmann equation is accepted today as giving a completely adequate des- 
cription of the behavior of dilute gases. In the next chapter we shall study some of the 
numerical results of the Boltzmann equation. 
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PROBLEMS 

18-1. Why is the operator exp(— iLt) called the time displacement operator? 

18-2 Derive Eq. (18-13). 

18-3. Derive the BBGKY hierarchy for the reduced distribution function f (n \ 
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18-4. Derive the Born-Green-Yvon-Kirkwood hierachy for the equilibrium w-partide 
correlation function g in} (r! , r 2 , . .., r„), Eq. (13-39), from the BBGKY hierachy for 
/ (n) (Pi , . . . , p„ , Ti , . . . , r„), Eq. (18-14). 

18-5. Why is the average of the peculiar velocity called the diffusion velotity [cf. Eq. 
(18-20)]? 

18-6. Prove that 


2 PjMjVj - 0 


18-7. Calculate both p and q given that /( r, v, t) is a local Maxwell distribution 


T ™ 1 3/2 

'-rt-'AMMTTA “ p 


m[y - v 0 (r, t)] 2 \ 
2kT(r, t) I 


18-8. Derive Eq. (18-30). 

18-9. Show that Eq. (18-42) is equal to Eq. (18-43). 

18-10. Prove Eq. (18-46). 

18-11. Prove Eq. (18-48). 

18-12. Derive the momentum balance equation, Eq. (18-52), from Eq. (18-49). 

18-13. Derive the energy balance equation from Eq. (18-49). 

18-14. Show that the first two integrals on the right-hand side of Eq. (18-56) vanish. 
18-15. Derive Eq. (18-58). 

18-16. Prove that the Boltzmann //-theorem implies that the steady-state solution to the 
Boltzmann equation is a local Maxwellian distribution, i.e., prove Eqs. (18-65), (18-66), 
and (18-67). 

18-17. Show that if a local Maxwellian distribution is used to calculate the averages in 
the equations of change, Eqs. (18-51) and (18-52), one gets the ideal hydrodynamic equations. 

18-18. The collision term in the Boltzmann equation is often written approximately as 


/o) 

where / 0 is a local Maxwellian distribution. Show that this implies that the difference of the 
distribution function from equilibrium decays as a simple exponential due to collisions. 
Interpret r. Give an estimate of its magnitude. Should t depend upon the velocity v? 



The standard method of solution to the Boltzmann equation, called the Chapman-Enskog 
method, is fairly long and involved. It is possible, however, to present an illuminating preview 
of the method by solving a simplified form of the Boltzmann equation in which the collision 
term is approximated by — (/— / 0 )/r. This will be discussed in Problems 18-19 through 
18-21. 

18-19. We start with the approximate form of the Boltzmann equation (cf. Problem 18-18) 


df 

^+w r /= 


f-fo 


where the relaxation time r may depend upon v and / 0 is a local Maxwellian distribution. 
We assume that the deviation of / from / 0 is linear in the velocity gradient and the tempera- 
ture gradient. Show that under this assumption we can replace / by f 0 in the left-hand side 
of the equation to get 
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Remember now that the temporal and spatial dependence of / 0 is given implicitly through 
the temporal and spatial behavior of p( r, /), v 0 (r, t), and T(r, t). Using the chain rule for 
df 0 /dt 9 namely, 

Qfo_^Qfodp avo dfo 8T 

~dt~~dp~dt+ v -'° ’lT + 5r a* 

and the ideal hydrodynamic equations (cf. Problem 18-17), show that the above Boltzmann 
equation becomes 

foKfV 2 - |)V • V In T+ b: Vv 0 ] = - 

T 

where W is a reduced velocity 



and 


b = 2(WW - \W 2 \) 

Except for the fact that we have approximated the collision term, this equation is the starting 
point for the second step in the Chapman-Enskog method. 

18-20. Because of the simple approximation for the collision term used .in the previous 
problem, we can solve the final equation immediately for / to get 

/= / 0 [1 - t{W 2 - f)V • V In T- rb : Vv 0 ] 

We can now use this expression for / to calculate p and q. We have already seen in Problem 

18-7 that p (0) = pkT\ and q (0) = 0. In this problem we shall calculate q, and in the next 
we shall calculate p. Show that 

m r 

q = — - J V 2 \f 0 r(W 2 - $)\dv • V In T 
and that 

liTtn r 00 

A = — J o f*TV*(W 2 -$)dV 

We wish to include the possibility that r depends upon V. If we assume that 
r =AV* 


show that 

A - M -w) i7r w+,)l 

Now let r IjV. If the mean free path / is taken to be equal to 1/V27rpcr 2 , then show that 



where C v is the molar heat capacity and TVo is Avogadro’s number. Compare this to Eqs. 
(16-32) and (16-37). 
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18-21. Using Eq. (18-25) and the expression for / given in the previous problem, show that 
Pu =ph,j -2m J f /o rV, Vj(W k W, - ifV 2 B u ) d\ ^ 

k.t=x,y.zJ vl 


or 


p 

where 


= [p + (S’? — *)V * Vo]l — 2r} sym(Vvo) 


■q =2mjf 0 TX\W,iVjd\ 

= 2m j f 0 rV, Vj W, (VjdV 
4nm 2 f a 

f 0 rV e dV 
J o 


15kT 


and 


= 0 


If we again assume that r — A V s 9 show that 


AirAnr 


V = 


15kT 

(2kT\ 


\2mkT] J 0 


6+s e -mv 2 /2kT jy 


112 r[i(7+^)] 

r(7/2) 


Show that 

4 /2\ 312 (mkT) 1 ' 2 

V 15V2 W ° 2 


if t = // v where /= 1/V27T per 2 . Compare this result to Eqs. (16-31) and (16-36). 

18-22. Consider a uniform gas with no external forces, so that the distribution function 
will depend only upon v and t and not r. Divide velocity space into equal finite cells coj and 
let fj(t) be the number of molecules whose velocity vectors end at time tin a>j. Show that the 
Boltzmann equation can be written in the form 

2 (««"/„/« - 
at j,k, l 

where the number of collisions per unit time of molecules from cell o>< with those from cell 
wj, which yield molecules in cells w k and w t , is arf 1 f t fj . Show that the number of collisions 
that yields a molecule in o> 4 is given by 

2 «*■"/*/. 

J.k.l 

A common and fundamental assumption is that of microscopic reversibility, which says that 

Interpret this physically. 

18-23. The simplified Boltzmann equation of the preceding problem yields a more trans- 
parent proof of the //-theorem than the full Boltzmann equation. Define the //-function by 

7f=2/iln/i 
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and show that 


dt i dt 


ln/i + 


d A 

dt 



= 2 In /,(/*/, -/,/,) 

tj.u 

= i 2 fli/‘(ln/« + In fjXfkf, - f,fj ) 

IJkl 

= i 2 «</‘Gn/./, - In fkfdifkfi 

IJkl 

<0 


Be sure to point out the symmetry property of the a u kl that is used in each line. 
18-24. One often sees the Boltzmann equation written in the form 

j- t + Y • v r / + 1 • V y / JdvJ d£lgl( 0 , ex/'fi' -//.) 


where d£l = sin 6 dO d<j> is the differential solid angle and l(g, 6 ) is the differential collision 
cross section for a collision, defined through 

bdbd<f> = I(g, 6 )dO. 

Derive this form of the Boltzmann equation. 

18-25. Show that the collision integral in the Boltzmann equation, i.e., the right-hand 
side of the Boltzmann equation, vanishes if /( r, v, t) is a local Maxwellian distribution. 

18-26. Show that the Maxwellian distribution of velocities satisfies the Boltzmann equa- 
tion identically. Is this also true for a local Maxwellian distribution? 

18-27. In Section 7-2 we proved the property of conservation of extension in phase space. 
We mentioned there that this could be proved directly from the equations of motion, which is 
the content of this problem. 

Let the natural motion in phase space map the set of phase points (p 0 , q 0 ) into ( p t , q t )- 
The property of conservation of extension in phase space can be written as 


[••• f dpt (0) 4p 2 (0) • • • dq„ (0) = [••• fdpt (t)dp 2 (t)"-dg„(t) 

JA(0) J J A(t) J 

where A(0) is some volume in phase space taken at some initial time and A(/) is the result of 
A(0) under the mapping. Show that to prove this property or theorem, one must prove that 
the Jacobian of the (p 0 , q 0 ) and (p t , q t ) is independent of time, or, that 

j d(p t , q t ) 

&(Po » qo) 

is independent of time. Now prove that this is so by proving that dJ'dt = 0 with the use of 
Hamilton’s equations of motion. (See Mazo, in “Additional Reading,” Section 2.4.) 

A number of simple models have been introduced into the statistical mechanical literature 
that are meant to elucidate the concepts of irreversibility, recurrence times, and to try to 
probe the basic assumptions underlying the Boltzmann equation, particularly the so-called 
Stosszahlansatz assumption of molecular chaos, or Eq. (18-31). The next two problems will 
introduce two of these models. 



PROBLEMS 


423 



Figure 18-4. A typical result of the Ehrenfest dog-flea experiment. 


18-28. This illuminating model is due to the Ehrenfests and is called the Ehrenfest dog- 
flea model. Suppose we have two dogs and 2 R fleas, numbered from 1 to 2 R. Initially let all 
2 R fleas be on one dog. Then draw a number from 1 to 2 R and move the flea with that number 
onto the other dog. Let this process be continued and let N A + N B — 2 R and N A — N B — 2k , 
where N A and N B are the number of fleas on the two dogs. If we draw a number and transfer 
a flea every r seconds, then t = sr, where s is the number of draws. A typical result of such 
an experiment is shown in Fig. 18 - 4 . 

Show that at equilibrium, i.e., after a suitable number of draws, that the probability 
distribution of k is given by 

( 2 *)! 

and that for large R and small k that 
W(k)x(7TRy 1,2 e- k2,R 

This corresponds to the Maxwell distribution. 

A valuable feature of this problem is that the “dynamics” can be solved exactly. Let the 
“state” of the system be designated by an integer varying from — R to R. Let P(n, m; s) 
be the probability that the system be in the state m given that it was in the state n s draws 
earlier, and let Q(n, m) = P(w, m; 1) be the transition probability. Show that 

P(n , m ; s) = 2 P(n , k; s— 1 )Q(k, m ) 

k 

= P(n, nr, s - 1 ) + J,P(n, k;s— l)Q(k, m) —P(n, m; s - 1 ) 2 Q(m> l) 

k (*m 

where of course 2"» Q(k, m) = 1. Compare this equation with the Boltzmann equation. An 
equation like this one, which describes how a probability distribution changes with time, is 
called a master equation . 

Show that the transition probability Q for the Ehrenfest dog-flea model is given by 

R + k ~ R — k ^ 

Q (k . kl = — + - jjj- 8* + ».»- 


and that the master equation becomes 
R + m+ 1 


P(n, m\ s) = 


R — m+1 

P(n,m + l;s-l)-\ — P(n,m—\;s—\) 


2 R ' ' 2 R 

Now show that if one starts in the state n, i.e., if P(n, m\ 0) = S„ then 
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which in the limit R ->■ oo, 1 Rt = y, st — t gives 
m{t) = ne~ yt 

Note that the average value of m decreases exponentially to its equilibrium value of zero. 
Thus on the average the system appears to decay irreversibly to its equilibrium state. 

This model can be analyzed in great detail, particularly with respect to Poincare’s theorem. 
Let j P'(n 9 rn\ s) be the probability that the system is in the state m for the first time after s 
draws given that it was in the state n initially. In particular, P'(n, n; s) is the probability that 
the recurrence time of state n is t = st. One can show that (cf. M. Kac, in Selected Papers in 
Noise and Stochastic Processes , edited by N. Wax, New York: Dover, 1954) 

J,P'(n 9 w; s) = 1 

s= 1 

which says that each state of the system is bound to recur with probability 1. This is the analog 
of Poincare’s theorem. One can also show that the mean recurrence time 


Tr = 2 srP(n 9 n\ s) 

3 = 1 


is equal to 


T k = t 


(R +«)!(/? — n)! 
(2R)! 


2 2R 


This is the analog of the Poincare recurrence time. The proof of the last few results is some- 
what involved, but is given in the article by Kac referred to above. 

If R -f n and R — n differ considerably, Tr is extremely large. For example, show that if 
R = 10,000, n = 10,000, and r — 1 sec, then 

Tr = 2 20 * 000 sec « 10 6 000 yr. 


Conversely, if R -j- n and R — n are comparable, T R is quite short. Show that, for example, 
if we set n — 0 in the previous calculation 


Tr tz 175 sec 


It was Smoluchowski who clearly elucidated that if one starts in a state with a long recur- 
rence time, the process will appear to be irreversible. For example, if one dog has 20,000 
fleas and the other zero, then one would observe what would appear to be an irreversible 
flow of fleas from one dog to the other. On the other hand, if the recurrence is small, it makes 
no sense to speak about irreversibility. Thus we see that the concept of irreversibility itself 
is intimately tied up with the magnitude of the Poincare recurrence time. 

18-29. Another interesting model is discussed by M. Dresden in Vol. 1 of Studies in 
Statistical Mechanics (“Additional Reading”) and is called by him the Kac Ring Model 
(Mark 1) after its originator, Mark Kac. Consider a circle with n equidistant points on the 
circumference. Let a set S of m points be permanently marked. We consider n and m to be 
large numbers, but that m<n . Now place a ball on each of the n points on the circle. These 
balls can be either black or white, there, of course, being n balls in all. This specifies what 
we may consider to be the initial state of the system. We now specify its “ dynamics.” During 
each elementary time interval, say 1 sec, all the balls move one step counterclockwise, with 
the rule that when a ball leaves one of the m marked points on the circle, it will change its 
color (white to black or black to white). We wish to determine the color distribution after a 
large number of steps given some initial distribution. Let N w (t) and N W {S 9 t) be the total 
number of white balls and the number of white balls in the set S (sitting on the m marked 
points), respectively, after t sec. Let N B (t) and N B (S, t ) be the corresponding quantities for 
black balls. Clearly we have the relations 

NvM + NM =n 
N w (S 9 t) + N b (S 9 t)-m 
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In addition, we define a quantity T(t) through 

n 

Show that the “equations of motion ” of the system are 

N w (t + 1) = N w (t) + N b (S, t) - N W (S , 0 
N B {t + 1) = N B (t) + N W (S, t) - N b (S , 0 

These can be thought of as corresponding to the Liouville equation. 

We now must resort to some approximation to proceed further, for otherwise these equa- 
tions are simply formal statements. We make the apparently reasonable assumption that 

N w (S,t) = -N w (t) 
rt 

N B (S 9 t) = -N B (t ) 
n 

These equations are analogous to the “ stosszahlansatz.” Substitute these into the above 
“Liouville equation’* to derive the “Boltzmann equation” for this system. Show that the 
solution to these two equations is 

T(t + 1) = (1 - 2^)r(/) 

where p ^ 1 is equal to m f n. Solve this by iteration to get 

T(/) = (1 - 2^) f r(0) 

This represents a monotonic approach to the equilibrium state — the number of white balls 
being the same as the number of black balls, independent of the initial distribution. This 
corresponds to the “Maxwell distribution” of our system. Note that this result is quite 
independent of the nature of the set 5. 

This result is incorrect! Clearly the model is strictly periodic with period 2 n (prove this). 
Hence this system has a Poincare recurrence time of 2/7, so that T(/) = + 2/7), which is a 

violent conflict with the above expression for ). The only assumption that is made is the 
above “ stosszahlansatz,” and so this apparently reasonable assumption has somehow intro- 
duced irreversibility into the problem. Dresden also discusses how this model has an analog 
to the time reversibility objection as well. We refer the reader to Dresden’s article for a 
thorough discussion of these apparent paradoxes. 
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TRANSPORT PROCESSES 
IN DILUTE GASES 


In the previous chapter we have derived and discussed the Boltzmann equation, which 
is the fundamental equation of the kinetic theory of dilute gases. The standard method 
for solving this integrodifferential equation is due to Chapman and Enskog and is now 
called the Chapman-Enskog method. This method is quite long and involved and so 
will be presented here only in outline. Chapman and Cowling as well as Hirschfelder, 
Curtiss, and Bird have detailed discussions of the Chapman-Enskog method (in 
“Additional Reading ”). 

In Section 19-1 we outline the Chapman-Enskog method; in Section 19-2 we present 
and summarize the formulas for the transport-properties of dilute gases and then make 
a detailed comparison of these formulas to experimental data in Section 19-3. Lastly, 
in Section 19-4, we discuss some extensions of the Boltzmann equation. In particular, 
we discuss the Enskog theory of dense hard-sphere fluid, which results from a simple, 
physical modification of the Boltzmann equation. 

19-1 OUTLINE OF THE CHAPMAN-ENSKOG METHOD 

The types of solutions obtained by the Chapman-Enskog method are a very special 
class of solutions, called normal solutions, in which the spatial and time dependence of 
/ (1) (r, v, t) appear implicitly through the local density, flow velocity, and temperature. 
These solutions describe the final stage of the relaxation of a dilute gas to its equilib- 
rium state. This final stage is called the hydrodynamic stage. The Chapman-Enskog 
method expands / in a series of the form (dropping the superscript on / (1) ) 

/ = ^/ C01 +/ C,1 + ^/ C21 + - (19-D 

where f is an ordering parameter and not a smallness parameter. It is set equal to 
unity later on. 

If we substitute Eq. (19-1) into the Boltzmann equation, we get a set of equations 
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for the Z 01 , which we then solve successively and uniquely (Problem 19-1). The 
equation for / t01 is easy to solve and gives (Problem 19-2) 


/ I0] (r, v, 0 = p(r. 




m[y- v 0 (r, t)] 2 
2kT(r, t) , 


(19-2) 


The/ 01 are chosen such that the equations of change that result from the Boltzmann 
equation keep the same form for increasing orders of /, but that the values of the 
parameters that appear in them, namely, p and q, do depend upon the order of /. 
In particular, since 


p = m J VV/ d\ 

and (1 9-3) 

q= iJ V2yfdv 

Equation (19-2) gives p (0) = pkT\ and q (0) =0 (Problems 19-6 and 19-7). These ex- 
pressions for p (0) and q (0) give the ideal hydrodynamic equations when substituted 
into the equations of change, i.e., Eqs. (18-51), (18-52), and (17-31). These equations 
specify p, v 0 , and T, and hence / £01 itself through Eq. (19-2). 

The equation for / m is quite a bit more complicated. If we let / £11 = (fif 101 , then <j> 
is given by the linear inhomogeneous integral equation (Problem 19-3): 


2n JJ/ t0] / [0r (4>' + &'-*- 4>i)gb db ds, = -p 2 m 


d fl0i 

~dT 


+ v 


V r / 101 + - X 
m 


V v / £0] 


(19-4) 

It turns out that the existence condition for this equation for (f> is just the ideal hydro- 
dynamic equations. Since / £01 is a normal solution, given by Eq. (19-2), the time de- 
pendence is given implicitly through p, v 0 , and T. So we write 

f foi . gy o # [01 8T 
dt dp dt V ° J dt dT dt 

and then use the equations of change for the time derivatives of p, v 0 , and T. This 
eventually gives (Problem 19-4) 

p 2 I(<j)) = -/l°)[(^ 2 - |)V V In T+ b : Vv 0 ] (19-5) 

where W is a reduced velocity. 



and 

b = 2(WW - %W 2 \) (19-7) 

This suggests looking for a solution <l> of the form 
l /2kT \ 1/2 1 

<£ = - - ( ) A(W ) W • V In T - - B(W )( WW - 1) : Vv 0 

p\m ) p 


(19-8) 
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where A(W) and B(W) are unknown scalar functions of W. Since /(<£) is a linear 
integral operator (Problem 19-5), we get two separate integral equations, one for 
A(W) and one for B(W). 

Furthermore, $ can be used to calculate the next approximation to p and q by 
means of Eqs. (19-3). One eventually finds that (cf. Hirschfelder, Curtiss, and Bird, in 
“Additional Reading”) 


.a) = _ _ 


2 k 2 T 


3 mp 


vr J A(W)W 2 (W 2 - I)/ 101 dv 


and 


(19-9) 


pd) = _ [ sym Vv 0 - KV • v 0 )l] J / [01 £(HO(WW - $W 2 1) : (WW - }W 2 I) dv 


5 P 


(19-10) 


Thus we know q (1) and p (1) if we solve the integral equations for A(W) and B(W). 
By comparing q (0) + q (1) to Fourier’s law, we get 


X = - j ^ jA(W)W 2 (W 2 - i)/ 101 dv (19-11) 

for the thermal conductivity. Similarly, by comparing p (0) + p (1) to the Newtonian 
pressure tensor [Eq. (17-34)], we find that 


n = ~Y J/ IO] B(B0(WW - i^ 2 l) : (WW - £W 2 I) dv 


5 P 


(19-12) 


Equations (19-9) and (19-10) show that / [01 + / m gives the Navier-Stokes equations. 
The A(W) and B{W) are found by expanding them in truncated series of orthogonal 
polynomials (Sonine polynomials). This is a standard method to solve integral equa- 
tions (Problems 19-46) and 19-47). The expressions we find for A(W) and B(W) 
depend upon the number of terms we include in their expansions in Sonine polyno- 
mials. Usually only one or two terms is needed. If we substitute two-term expansions 
into Eqs. (19-11) and (19-12), we get 


A 75k 2 r r 1 | t 

8 m Un 0,1^22 - a 12 2 
5k7Tl ] 

n 2 U„ 6,1*22 -*i2 2 J 


(19-13) 

( 19 - 14 ) 


The a u and b tj appearing in these equations are complicated integrals over the dy- 
namics of bimolecular collisions and hence contain the information about the inter- 
molecular potential. All of these quantities can be reduced to linear combinations of 
a set of collision integrals, For collisions between molecules of type /and type j, 

these integrals are defined by 

( ?7 rkT\ 1/2 /.°° r 00 

—) f f e-*‘Sy u 2 ’ + \l - cos' X )b db dy tJ (19-15) 

Pij / J o J o 

In these integrals, is the reduced mass; g xj is the relative velocity; x = 9ij) is 
the deflection angle; and y u is a reduced relative velocity, y fj - = (/z 0 /2A:T) 1/2 g ii . These 



OUTLINE OF THE CHAPMAN-ENSKOG METHOD 429 


integrals must be evaluated numerically, and we shall discuss them in the next section. 
We simply list some results from Chapman and Cowling: 

a,, =4Q (2,2) 

a, 2 = 7Q (2 - 2 > - 2fi< 2 - 3 > 

a 22 = TL Q(2,2) _ 7 Q(2,3) + tf2.4) 

b n = 4fi (2,2) 

b l2 = 7Q (2,2) - 2ft (2,3) 

b 22 = fl< 2 ’ 2 > - 7n< 2 3> + Q (2,4> (1 9-1 6) 


Equations (19-13) through (19-16) then give X and t] in terms of the intermolecular 
potential. In the next section we shall study X and rj as a function of intermolecular 
potential and temperature as we did for the second virial coefficient in Chapter 12. 

In order to study diffusion, it is necessary to consider multicomponent systems. 
The main results are that the one-term expansion for the binary diffusion coefficient 
is given by 


3 kT 

\6pn i2 n 12 (1,1) 


(19-17) 


Furthermore, we discovered the possibility of diffusion due to a temperature gradient 
as well. This phenomenon is known as thermal diffusion and was unobserved experi- 
mentally and unknown theoretically before being discovered by Enskog. The co- 
efficient associated with this phenomenon is called the thermal diffusion coefficient 
£>! T . Usually, however, it is the thermal diffusion ratio k T , 




Pm D* 

p 2 m pn 2 D i 2 


(1 9-1 8 ) 


that is discussed, both experimentally and theoretically. 

These results are all for the first two terms in Eq. (19-1). One could go on to cal- 
culate f i2 \ but this becomes even more tedious. It is interesting to note that if this is 
done, one gets hydrodynamic equations, which are more general than the Navier- 
Stokes equations. These more general equations are called the Burnett equations. 
They contain higher derivatives of the thermodynamic quantities and powers of lower 
derivatives. This makes these equations very difficult to use in practice. Nevertheless, 
it clearly shows that there is not one set of hydrodynamic equations but several, each 
set offering increasing detail and precision. The Burnett equations have been applied 
to shock waves in gases, but the agreement between experiment and theory is not good. 
This is probably because the entire Chapman-Enskog method is a development in 
stages away from a local Maxwellian velocity distribution. One must expect the results 
to be most applicable when the system is near equilibrium. Just how near is hard 
to say. 

Grad has developed quite a different method to solve the Boltzmann equation. This 
method gives the ideal hydrodynamic equations in the zero approximation and gives 
the Navier-Stokes equations in the first approximation, but in the next approximation 
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gives something quite different from the Burnett equations. Grad’s method is discussed 
in Section 7-5 in Hirschfelder, Curtiss, and Bird and in the two books by Grad 
himself (in “Additional Reading”). 

We are now ready to discuss the results of the Chapman-Enskog solution. 


19-2 SUMMARY OF FORMULAS 

We have seen that all of our results can be expressed in terms of the collision 
integrals Q 0 c,,s) . These are given by 



(19-1 9) 

where 


Q (,) (7ij) = 27r f (1 - cos' y)b db 

J o 

(19-20) 

and 


yif), y„) - it - 21> J „ 

L r 2 fc7VJ 

(19-21) 

In these equations, y 2 = pig 2 /2JcT; pi is the reduced mass; and r m is the distance of 
closest approach of the colliding molecules. 

For hard spheres of diameter a, Eqs. (19-19) and (19-20) become (Problem 19-13) 

n «. s , _/*n i/ 2 (J+ 1 ) V> 

^.gsph y 2n ^J 2 Urigsph 

(19-22) 

with 


j 1+; ]”” 2 

(19-23) 

Note that these quantities are independent of y for hard spheres. Using Eqs. (19-13), 
(19-14), (19-16), and (19-17), we can write for hard spheres (Problem 19-14): 

75 k 2 T 25 C u lnkT\ 1 ' 2 1 

8 man 32 N \ m J na 2 

(19-24) 

_ 5 kT 5 (tt mkT) 1 ' 2 

n 2 b tl 16 no 2 

(19-25) 

3 kT 3 /nkT\ 112 1 

8 mp 8 \ m ) no 2 p 

(19-26) 


To get these equations, remember that pi = mj2 when the two colliding atoms are 
identical. These equations are exactly those presented in Chapter 16 as the rigorous 
kinetic theory expressions. The advantage here, however, is that these are special 
cases of a more general theory that includes any realistic intermolecular potential. 

For most intermolecular potential functions, we can write the collision integrals 
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given by Eqs. (19-19) to (19-21) in a reduced form by introducing reduced variables. 
If the intermolecular potential can be written in the form 


then we can introduce reduced variables: 


(19-27) 


r* 


r 

o 


b* 


b 

G 


U* 


U 

e 


T* 


fcT 

e 


,*2 


1 M0 2 

2 e 


(19-28) 


Just as we reduced the second virial coefficient by its hard-sphere value, we reduce 
Q (n (g) and Q (, » s) (T) by dividing them by their hard-sphere values. Thus 


Q (ir (T*) 

Q (»,*)*( T *) 



Q (,,s) 


O (l,s) 
^rig sph 


(19-29) 

(19-30) 


These reduced quantities physically represent the deviation of any particular molecular 
model from the hard-sphere model. In terms of these reduced quantities, we have 
(Problem 19-16) 


25 C„ /nkT\ il2 1 
32 N \ m ) 7t«r 2 Q (2,2) * 

5 (n mkT) m 
no 2 fi< 2 - 2 >* 

3 (nkT\ 112 1 

pno 2 VS 1 -"' 


(19-31) 

(19-32) 

(19-33) 


These are the generalizations of Eqs. (19-24) to (19-26). They reduce to the hard- 
sphere results where Q iltS) * = 1. 

These formulas are the result of using simply a one-term expansion for A(W ) and 
B(W ) in Eqs. (19-11) and (19-12). Equations (19-31) and (19-32) are, in fact, the first 
terms in Eqs. (19-13) and (19-14). Such one-term expansions turn out to be satisfac- 
tory for most transport coefficients (the thermal diffusion ratio is a notable exception). 
It is common to write the expressions for these transport coefficients in the form 

. 25 C. (nkT\ l/2 f x 

32 N \ m / 7rcr 2 Q (2,2) * 


(1 9-34) 
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for example, where f x is a factor close to unity that corrects A to a more precise value. 
Equation (19-13) shows that f x is 


/* = 1 + 


a 


12 


a \ l a 22 ~ a 


(19-35) 


12 


The correction factors and f D , etc., are given in Appendix 8 A of Hirschfelder, 

Curtiss, and Bird (in “Additional Reading”).* Table 19-1 gives these factors as a 
function of temperature for the Lennard- Jones 6-12 potential. It clearly shows that 
these higher corrections are small. This substantiates our contention that the integral 
equations for A(W) and B(W) can be adequately solved by using just a one-term 
expansion in Sonine polynomials. 


Table 19-1. The effect of higher approximation of the thermal transport coefficients for a Lennard- 
Jones 6-12 potential* 



[A] 3 [A]i/a <3) [Dh = 


T* 

/„ <3) 

A ,3) 

f D m 

0.30 

1.0014 

1.0022 

1.0001 

0.50 

1.0002 

1.0003 

1.0000 

0.75 

1.0000 

1.0000 

1.0000 

1.00 

1.0000 

1.0001 

1.0000 

1.25 

1.0001 

1.0002 

1.0002 

1.5 

1.0004 

1.0006 

1.0006 

2.0 

1.0014 

1.0021 

1.0016 

2.5 

1.0025 

1.0038 

1.0026 

3.0 

1.0034 

1.0052 

1.0037 

4.0 

1.0049 

1.0076 

1.0050 

5.0 

1.0058 

1.0090 

1.0059 

10.0 

1.0075 

1.0116 

1.0076 

50.0 

1.0079 

1.0124 

1.0080 

100.0 

1.0080 

1.0125 

1.0080 

400.0 

1.0080 

1.0125 

1.0080 


* The superscript on the/'s indicates the number of terms used in equations such as Eqs. (19-13) and (19-14). 
Source: J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Cases and Liquids (New York: 
Wiley, 1954). 


Table 19-2. The types of collision integrals that appear in the dilute gas expressions for the thermal 
transport coefficients 


property 

1 st approximation 

2nd approximation 

V 

ft f2 - 2 > 

ft< 2 ’ 5 > 

3=2. 3, 4 

A 

ft< 2 - 2 > 

a=2. 3. 4 

D 

ft 11 - 1 ’ 

CP’ 2 ), 

s= 1, 2. 3 

K 

— 

n (2 ' t w i * s) n (3 ' 3) 

3 = 1. 2. 3, 4, 5 
t=2. 3, 4 


Table 19-2 gives the various types of collision integrals that appear in the first two 
approximations for A, t], D, and k r . Table 19-3 gives the fraction of the true value of 
the hard-sphere transport coefficients in the fcth approximation. It can readily be seen 
that the thermal diffusion ratio is the most demanding quantity to calculate. A look at 
Hirschfelder, Curtiss, and Bird’s Appendix 8A or Appendix I of an excellent summary 


* See also E. A. Mason, J. Client. Phys ., 22, p. 169, 1954. 
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Table 19-3. Fraction of the true value of the transport coefficients in the Arth approximation* 


p 

iPhim 

[puiP] 

[f]3/[P] 

V 

0.984 

0.999 

0.999+ 

A 

0.976 

0.998 

0.999+ 

D 

0.883 

0.957 

0.978 

k t 

0.77 

0.88 

— 


* Based on the rigid sphere model. [ P k ] /cth approximation to the coefficient P; [P] = true value of the 

coefficient P. 


Source: J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Cases and Liquids (New York: 
Wiley, 1954). 


by Mason* will show that the formula for k T can become quite long. You will also 
discover there another set of formulas for the etc., due to Kihara, which seem 

to be more convenient than those obtained from the Chapman-Enskog procedure. 
They are all equal to unity to within just a few percent, however. 

Equations (19-31) to (19-33), then, can be used to calculate 2, r\ y and Das a function 
of the intermolecular potential. All we need now are tables of the We discuss 
these Q ilf s) in the next section. 


19-3 TRANSPORT COEFFICIENTS FOR VARIOUS 
INTERMOLECULAR POTENTIALS 


It is possible to evaluate the fl (Ls) analytically for the hard-sphere potential, but not 
for the other potential functions we shall discuss. We saw in Chapter 16 that the 
deflection angle x(b y g) is 


x(g, b)= 2 arccos 



b <g 


= 0 


b>o 


for hard spheres of diameter o. Substitution of this result into Eqs. (19-20) and (19-19) 
gives Eqs. (19-22) and (19-23), which in turn give Eqs. (19-24) to (19-26) for 2, rj, 
and D. We have already discussed these rigid sphere results in Chapter 16. 


Table 19-4. Square-well parameters obtained from second virial coefficient data and viscosity data 


gas 

from B 2 (T) 


from rj(T) 


m°K) 

a(A) 

A 

elk(°K) 

cr(A) 

A 

Ar 

93.3 

3.07 

1.70 

167 

2.98 

1.96 

n 2 

95.2 

3.28 

1.58 

80 

3.36 

2.08 

co 2 

284 

3.57 

1.44 

200 

3.46 

2.22 

methane 

142 

3.35 

1.60 

244 

3.81 

2.29 


Source: The second virial coefficient values are taken from Sherwood and Prausnitz, J. Chem. Phys. 9 41, 
p. 429, 1964, and the viscosity values are taken from J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, 
Molecular Theory of Cases and Liquids (New York: Wiley, 1954). 


Contrary to the second virial coefficient, the collision integrals must be evaluated 
numerically for the square-well potential. Table 19-4 gives a comparison of the square- 
well potential parameters obtained from the second virial coefficient and the viscosity. 

Of course, both of these theories are essentially exact, and so the discrepancy is due 
to the inadequacy of the square-well potential. 


E. A. Mason,/. Chem . Phys., 22, p. 169, 1954. 
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Table 19-5. The integrals for calculating the transport coefficients for the Lennard- Jones 

6-12 potential 


T* 




n< 2 - 4 >* 

0.60 

1.877 

2.065 


1.610 

0.80 

1.612 

1.780 

1.549 

1.389 

1.00 

1.439 

1.587 

1.388 

1.258 

1.20 

1.320 

1.452 

1.280 

1.174 

1.40 

1.233 

1.353 

1.205 

1.115 

1.60 

1.167 

1.279 

1.149 

1.072 

1.80 

1.116 

1.221 

1.106 

1.038 

2.00 

1.075 

1.175 

1.073 

1.012 

2.20 

1.041 

1.138 

1.045 

0.9895 

2.40 

1.012 

1.107 

1.022 

0.9710 

2.60 

0.9878 

1.081 

1.002 

0.9555 

2.80 

0.9672 

1.058 

0.9935 

0.9485 

3.00 

0.9490 

1.039 

0.9708 

0.9295 

3.20 

0.9328 

1.022 

0.9578 

0.9185 

3.60 

0.9058 

0.9932 

0.9358 

0.8995 

4.00 

0.8836 

0.9700 

0.9175 

0.8840 

5.00 

0.8422 

0.9269 

0.8823 

0.8530 

6.00 

0.8124 

0.8963 

0.8565 

0.8295 

8.00 

0.7712 

0.8538 

0.8193 

0.7945 

10.00 

0.7424 

0.8242 

0.7923 

0.7690 

20.00 

0.6640 

0.7432 

0.7160 

0.6950 

40.00 

0.5960 

0.6718 

0.6475 

0.6285 

50.00 

0.5756 

0.6504 

0.6268 

0.6085 


Source: J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Cases and Liquids (New York: 
Wiley, 1954). 


Hirschfelder, Curtiss, and Bird discuss the transport coefficients of a Lennard-Jones 
6-12 gas in great detail. Table 19-5 gives values of several as a function of the 
reduced temperature. Table 19-6 lists the Lennard-Jones 6-12 parameters obtained 
by fitting viscosity versus temperature data and second virial coefficient versus tem- 
perature data. Tables 19-5 and 19—6 allow one to calculate the viscosity at any tem- 
perature (Problem 19-20). Figure 19-1 shows the viscosity for a number of substances 
versus temperature plotted in reduced units. One can see from Eq. (19-32) that 
(Problem 19-28) 






(19-36) 


is a reduced viscosity. Similar agreement is found for the thermal conductivity. 
Figure 19-2 gives the reduced coefficient of self-diffusion versus reduced temperature. 


Table 19-6. Lennard-Jones 6-12 parameters determined for second virial coefficient and viscosity 
measurements 


second virial 

coefficient viscosily 


gas 

elk(°K) 

o(A) 

e/*C K) 

o(A) 

Ar 

117.7“ 

3.504“ 

124 b 

3.418 b 

Kr 

164.0“ 

3.827“ 

190*" 

3.61 b 

Xe 

222.3“ 

4.099“ 

229 b 

4.055 b 

n 2 

95.2“ 

3.745“ 

91. 5 b 

3.681 b 

co 2 

198.2“ 

4.328“ 

190 b 

3.996 b 

ch 4 

148.9“ 

3.783“ 

137 b 

3.822 b 

Source: 

“A. E. Sherwood and J. M. Prausnitz, J. Chem. Phys. t 41, p. 429, 1964; b J. O. Hirschfelder, C. F. 


Curtiss, and R. B. Bird, Molecular Theory of Cases and Liquids (New York: Wiley, 1954). 
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Figure 19-1. Comparison of the theoretical curve of log (r}*l(T*) 112 ) with experimental data. The ex- 
perimental values are reduced according to the relation rj* = r)o 2 ftmey j2 . (From J. O. 
Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Gases and Liquids. 
New York: Wiley, 1954.) 


The reduced coefficient of self-diffusion is obtained from Eq. (19-33) (Problem 19-30): 


D* 


Dm 112 

o ^ 2 


(1 9-37) 


Again it can be seen that the agreement is very good. 

The thermal diffusion ratio is much more complicated than the three transport co- 
efficients we have discussed above. It seems to be the most sensitive to the form of the 
intermolecular potential. Hanley and Klein* discuss the determination of the inter- 
molecular potential from the thermal diffusion ratio in a mixture of isotopes. 

It can be seen from Figs. 19-1 and 19-2 that the Lennard- Jones 6-12 potential, with 
its two parameters e and <r, can be used to fit the thermal transport data of dilute gases 
quite well. The agreement is comparable to that found in Chapter 12, where we dis- 
cussed a similar comparison for the second virial coefficient. A more detailed numerical 
study, however, would show that the Lennard- Jones 6-12 potential with fixed values 
of e and o is not able to yield satisfactory agreement over an extended temperature 
range. In addition. Table 19-6 shows that the Lennard-Jones parameters obtained by 
fitting second virial coefficient data do not agree with those obtained by fitting vis- 
cosity data. Of course, since the theoretical formulas expressing these quantities in 
terms of the intermolecular potential are essentially exact, this indicates that the 
Lennard-Jones 6-12 potential is inadequate. 

A great deal of work has been done to devise a semiempirical intermolecular po- 
tential that is flexible enough to fit a number of experimental quantities simultaneously. 
These potentials usually involve more than two parameters and can become fairly 
complicated. The review articles by Fitts and Curtiss given in the “Additional Read- 
ing” discuss a number of these potentials and list references to numerical tabulations. 
Figure 19-3 shows the coefficient of viscosity for argon versus temperature for several 
widely different forms of the intermolecular potential. One can see from this figure 
that although rj(T) is somewhat insensitive to u(r), there are, indeed, discrepancies. 


* H. J. M. Hanley and M. Klein, J. C/tent. Phys. t 50, p. 4765, 1969. 
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Figure 19-2. The coefficient of self-diffhsion in reduced units. D* = (Djo)V mfe. The solid cnrve is the 
calculated curve: D*p*T*~ z,z = (3/8V7r)(l/f} (1 * 1,# ). The plotted points are the ex- 
perimental values in reduced units. (From J. O. Hirschfelder, C. F. Curtiss, and R. B. 
Bird, Molecular Theory of Gases and Liquids. New York: Wiley, 1954.) 


These are clearly illustrated by means of a so-called deviation plot, in which a difference 
such as ( rj cxp — tj ca ic)A/exp IS plotted versus temperature. Figure 19-4 shows such a 
deviation plot for argon for the Lennard- Jones 6-12 potential, the exp-6 potential and 
the Kihara potential. The exp-6 potential is given by 




(19-38) 


where r min is the value of r for which u is a minimum, e is the depth of the potential, 
and a is a parameter that governs the steepness of the repulsive part of the potential. 
This potential is discussed in Hirschfelder, Curtiss, and Bird, and the collision 
integrals have been tabulated by Mason.* 

The Kihara potential is given by 




(19-39) 


In this potential the finite size of the molecule is taken into consideration by including 
a core parameter a. The collision integrals for this potential have been tabulated by 
Barker, Fock, and Smith. f Figure 19-4 clearly shows marked deviations at tempera- 
tures below 200°K. 


* E. A. Mason, J. Chem. Phys. t 22, p. 169, 1954. 
t J. A. Barker, W. Fock, and F. Smith, Phys. Fluids , 7, p. 897, 1964. 
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Figure 19-3. The coefficient of viscosity of argon as calculated for several molecular models. (From J. O. 

Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Gases and Liquids y 
New York: Wiley, 1954.) 


There has been much discussion in the literature concerning the relative merits of 
various intermolecular potentials. A simple potential that seems to correlate many 
experimental data over a large temperature range is the aw-6-8 potential, which has 
the form 


M (r) = 


A 

r 


„8 


(19-40) 


In terms of the usual parameters of e, the depth of the well, and a 9 the value of r at 
which the potential equals zero, u(r) becomes 


u(r) 


(6 + 2y) 
n — 6 



[n - y(n - 8)] 



e 


n — 6 


(1 9-41) 
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Figure 19-4. Percentage deviation plots {0? Mp — r) ctlc )/r) clLP } x 100, where 7j exp are experimental values 
of the viscosity of argon and the ^ c .ic are calculated for the Lennard-Jones 6-12 potential, 
the exp-6 potential, and the Kihara potential. (From H. J. M. Hanley, /. Chem. Phys., 44, 
p. 4219, 1966.) 


where d = r mi Ja and y = c 8 /e(r min ) 8 . Using this potential, Hanley and Klein* have 
performed extensive calculations and comparisons to experimental data. Figure 19-5 
shows deviation plots for the second virial coefficient and viscosity coefficients of 
argon. According to Hanley and Klein, the m- 6-8 potential is the most successful 
few-parameter potential to correlate both second virial coefficient and transport data. 
The m- 6-8 parameters have not been determined for many substances yet, but Table 
19-7 gives them for argon, krypton, and xenon. In addition. Appendix E gives the 
reduced second virial coefficient, several reduced collision integrals, and the Chapman- 
Enskog correction factors/ for the 1 1-6-8 potential with y = 3. Ely and Hanley t have 


* H. J. M. Hanley and M. Klein, /. Phys. Chem., 76, p. 1743, 1972. 
t J- F. Ely and H. J. M. Hanley, Mol. Phys., 24, p. 683, 1972. 
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Figure 19-5. Deviation plots of the viscosity and second virial coefficient of argon. The calculated values 
of B 2 and tj are determined using the m- 6-8 potential with #n = ll, y— 3, a- 
3.292 A, and elk = 153°K). (From H. J. M. Hanley and M. Klein, J. Phys. Chem., 76, 
p. 1743, 1972.) 


Table 19-7. The m-6-8 potential parameters for argon, krypton, and xenon 


gas 

m 

y 

ct(A) 

e/kC K) 

Ar 

11 

3 

3.292 

153 

Kr 

11 

3 

3.509 

216 

Xe 

11 

3 

3.841 

295 


Source: H. J. M. Hanley and M. Klein, J. Phys . Chem., 76, p. 1743, 1972. 


also applied the m-6~ 8 potential to the nonpolar polyatomic gases 0 2 , N 2 , and C0 2 . 
It is possible that this* potential will emerge as the most useful potential for a serious 
calculation of the thermodynamic or transport properties of dilute gases, but this area 
is still a state of flux and discussion as an interesting recent paper by Hanley, Barker, 
Parson, Lee, and Klein shows.* 


* H. J. M. Hanley, J. A. Barker, J. M. Parson, Y. T. Lee, and M. Klein, Mol. Phys., 24, p. 11, 1972. 
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19-4 EXTENSIONS OF THE BOLTZMANN EQUATION 

Although the theory of transport phenomena in dilute gases made up of structureless 
spherical particles is well developed, all of the obvious extensions are still areas of 
research. For instance, there are still difficulties in trying to predict the transport 
properties of mixtures of monatomic gases. The equations rapidly become long and 
unwieldy, as one can see from Section 8.2 of Hirschfelder, Curtiss, and Bird. The 
difficulties in the case of polyatomic gases are even worse, however, since in this case 
there was still some question concerning the formulation of the Boltzmann equation 
itself until only recently. Furthermore, polyatomic molecules undergo inelastic as well 
as elastic collisions, and so the transport in polyatomic gases gets very involved with 
molecular scattering theory. We refer to the article by Curtiss in “Additional Reading ” 
for a discussion of the basic formulation of this theory and to papers by Mason and 
co-workers* * * § for practical applications. 

The 1960s saw a great deal of research in trying to extend the Boltzmann equation 
to higher densities. We learned in Chapter 12 that one can expand equilibrium thermo- 
dynamic properties in a power series in the density, and it is natural to try to expand 
the transport coefficients in a similar way. One approach has been to derive a general- 
ized Boltzmann equation from the BBGKY hierarchy by approximating the pair 
distribution function in the integral as a functional of the singlet distribution function. 
This was first suggested by Bogoliubov in 1946 and popularized by Uhlenbeck in this 
country. This development is very complicated, but expressions have been derived for 
the density corrections to the transport coefficients. An interesting result of all this is 
that terms in p 2 and higher do not exist. When the formal expressions derived for 
them are evaluated, they diverge. It appears now that logarithmic terms occur after 
the linear term. It has been found that a p 2 In p term seems to occur in the expansion 
of the transport coefficients. If this is correct, then the transport coefficients are not 
analytic functions of the density, and it is therefore not surprising that formulas de- 
rived on this assumption give divergent results. The Bogoliubov theory is too involved 
to go into here, but is discussed in Chapter 6 of Mazo and in the review article by 
Ernst, Haines, and Dorfman.t Hanley, McCarty, and Sengers have presented a de- 
tailed analysis of experimental data of transport coefficients as a function of density.! 
The review article by Curtiss discusses several other approaches as well as the 
Bogoliubov theory. 

There is an interesting and useful modification of the Boltzmann equation so that it 
yields an approximate theory of transport in dense hard-sphere fluids. This extension 
is due originally to Enskog§ and is now called the Enskog theory. The manipulations 
involved in deriving the final (simple!) results of the Enskog theory are fairly lengthy, 
and so we shall present only the end results here. The two references§ below, however, 
have excellent discussions on the Enskog theory. 

The Boltzmann equation considers only binary collisions and so ordinarily is not 
applicable to dense systems, but in the case of hard spheres the collisions are instan- 
taneous, and so the probability of multiple encounters is negligible. Enskog used this 
fact to graft a dense hard-sphere transport theory onto the Boltzmann equation. 


* L. Monchick and E. A. Mason, J. Client. Phys., 35, p. 1676, 1961; E. A. Mason and L. Monchick, ibid., 
36, p. 1622, 1962; L. Monchick, K. S. Yun, and E. A. Mason, ibid., 39, p. 654, 1963. 

f M. H. Ernst, L. K. Haines, and J. R. Dorfman, Rev. Mod. Phys ., 41, p. 296, 1969. 

% H. J. M. Hanley, R. D. McCarty, and J. V. Sengers, J. Chem. Phys., 50, p. 857, 1969. 

§ Section 9.3 of Hirschfelder, Curtiss, and Bird (“Additional Reading") and Chapter 16 of Chapman and 
Cowling (“Additional Reading”) are excellent references. 
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In the derivation of the Boltzmann equation, two assumptions that are made are 
that only binary collisions are important and that the molecules have zero size or, 
more exactly, that the molecular diameter o is small compared to the mean free path 
of the gas. Both of these assumptions are sensible in a dilute gas but not in a dense 
system. As the density of a dilute gas is increased, two effects become important be- 
cause the molecules how have a nonzero size. The first is a positional correlation, which 
will appear through the radial distribution function. Since g{r) is greater than unity 
when the molecules are near each other, there is an increase in the rate of collisions. 
Secondly, collisional transfer of flux becomes important. In a dilute gas the only 
mechanism for the transport of flux is the movement of the molecule itself through a 
plane. In a dense system, on the other hand, it is possible for a molecule on one side 
of a plane to collide with another molecule on the other side of a plane and transfer 
some momentum or energy across the plane even though neither molecule itself crosses 
through- the plane. This manner of transfer of flux is called collisional transfer. Of 
course, this mechanism is present in dilute gases also, but it does not become important 
until the density becomes large enough. In liquids, in fact, collisional transfer of flux 
is more important than molecular transfer. Two main factors, then, distinguish the 
Enskog theory from the Boltzmann equation; the frequency of collisions in a dense 
system is greater than in a dilute gas, and collisional transfer of flux is important. 

Recognizing these modifications, one derives an equation similar to the Boltzmann 
equation, called the Enskog equation, which is then solved by the Chapman-Enskog 
method. We shall simply summarize the results of the Enskog theory. Following 
Hirschfelder, Curtiss, and Bird, we introduce a parameter Y = b 0 pg(&) = (p/pkT) — 1. 
In terms of Y, we have 


n =1 + 0.800 + 0.761 Y 


t]°b 0 p Y 

K 


*l°boP 


= 1.002 y 


A 1 

= - +1.20 + 0.755 Y 


l°b 0 P Y 
D 1 


D°b 0 p Y 


(19-42) 

(19-43) 

(19-44) 

(19-45) 


The molecular dynamics rigid sphere transport coefficients have recently been cal- 
culated by Alder, Gass, and Wain wright.* Their results are given in Table 19-8. They 
identify the first terms in Eqs. (19-42) and (19-44) with the kinetic contribution, the 
third terms with the potential contribution, and the middle terms with a cross term. 
These are all presented separately in Table 19-8. One can see from there that the 
agreement with “ experiment” is quite good except for densities approaching the solid 
phase (i v/v 0 « 1.5). (See Fig. 13-2.) Alder, Gass, and Wainwright give a thorough dis- 
cussion of the results of the Enskog theory. 

The Enskog theory has been extended in a number of ways. For example, Thome-)- 
has extended the Enskog theory to binary hard-sphere mixtures, and Tham and 


* B. J. Alder, D. M. Gass, and T. E. Wainwright, /. Chem. Phys. t 53, p. 3813, 1970. 
f S. Chapman and T. G. Cowling, The Mathematical Theory of Non-Uniform Cases (Cambridge: Cambridge 
University Press, 1939); L. Barajas, L. S. Garcia-Colin, and E. Pina,/. Stat. Phys. 9 7, p. 161, 1973. 
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Gubbins* have published the multicomponent theory. In addition, Gasst has applied 
the Enskog theory to a fluid of rigid disks (i.e., a two-dimensional fluid), and Davis, 
Rice, and Sengersf have extended it to a square-well fluid. 

In spite of the fact that the Enskog theory is strictly a rigid sphere theory, Enskog 
showed how the results could be applied in an ad hoc manner to real systems. He 
suggested that instead of relating Y to the actual pressure of the system, i.e.. 


r -£r-' <’ M6) 

that one should introduce the so-called thermal pressure, T(dp/dT) v . The justification 
for this is that the pressure experienced by a single molecule is made up to two parts: 
the external pressure p due to the walls of the container and the “ internal pressure ” 
(5E/dV) T9 which represents the force of cohesion of the molecules. The sum of these 
is related to the thermal pressure by the thermodynamic relation 


(a— a 

Therefore, we write for Y : 



(19-47) 


(1 9—48) 


Furthermore, since Yjp 


dB 

bo = B(T) + T— 


b 0 as p -* 0, we must have 


where B(T) is the second virial coefficient. 

Equations (19-47) and (19-48), along with Eqs. (19-42) through (19-45), and the 
dilute gas expressions for rj ° , A 0 , and D° given in Section 19-2 give us a complete 
approximate modification of the Enskog rigid sphere theory. One can calculate the 
transport coefficients of a dense real system from equation-of-state data alone. Tables 
19-9 and 19-10 give an illustration of Enskog’s semiempirical extension of his rigid 


Table 19-9. The viscosity of nitrogen at 50° C as a function of pressure according to the modified 
Enskog theory 


p (atm) 

(g/cm 3 ) 

(cm 2 /sec) 

7} X 10 7 
(g-cnr* 

•sec -1 ) 

experimental 

calculated 

15.37 

0.01623 

117,900 

1,913 

1,810 

57.60 

0.06049 

32,740 

1,981 

1,900 

104.5 

0.1083 

19,280 

2,088 

2,050 

212.4 

0.2067 

11,480 

2,373 

2,240 

320.4 

0.2875 

9,520 

2,737 

2,660 

430.2 

0.3528 

8,370 

3,129 

3,080 

541.7 

0.4053 

8,660 

3,509 

3,480 

630.4 

0.4409 

8,590 

3,786 

3,800 

742.1 

0.4786 

8,700 

4,163 

4,180 

854.1 

0.5117 

8,890 

4,550 

4,550 

965.8 

0.5404 

9,090 

4,913 

4,920 


Source: J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Cases and Liquids (New York: 
Wiley, 1954). 


* M. K. Tham and K. E. Gubbins, /. Chem . Phys. y 55, p. 268, 1971. 
t D. M. Gass, /. Chem . Phys. y 54, p. 1898, 1971. 

% H. T. Davis, S. A. Rice, and J. V. Sengers ,/. Chem . Phys ., 35, p. 2210, 1971. See also J. B. Schrodt, J. S. 
Ku, and K. D. Luks, /. Chem. Phys. y 57, p. 4589, 1972. 
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Table 19-10. The viscosity of carbon dioxide at 40.3"C as a function of pressure according to the modified 
Enskog theory 


p (atm) 

(g/cm 3 ) 

(cm 2 /sec) 

rj X 10 7 
(g • cm -1 

experimental 

• sec -1 ) 

calculated 

45.3 

0.100 

18,000 

1,800 

1,910 

64.3 

0.170 

11,500 



75.9 

0.240 

9,080 

2,180 

2,160 

82.7 

0.310 

7,840 

2,430 

2,400 

86.8 

0.380 

7,240 

2,750 

2,730 

89.2 

0.450 

7,020 

3,160 

3,150 

91.7 

0.520 

7,040 

3,660 

3,670 

94.9 

0.590 

7,220 

4,260 

4,270 

101.6 

0.660 

7,560 

4,990 

4,980 

114.6 

0.730 

7,950 

5,800 

5,780 


Source: J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Cases and Liquids (New York: 
Wiley, 1954). 


sphere theory. One can see from the results that in spite of the approximate nature of 
the theory, the agreement with experiment is quite good when one compares the density 
variation of the transport coefficients at a fixed temperature. 

Hanley, McCarty, and Cohen* have made a thorough comparison of the modified 
Enskog theory to experimental data. Specifically, experimental data and theoretical 
predictions for the first density corrections to the viscosity and thermal conductivity 
were examined, and the temperature and density dependences of the experimental and 
theoretical transport coefficients in the liquid were studied. Overall* the modified 
Enskog theory, with the exception of the critical region for the thermal conductivity, 
was found to give reasonable agreement with experiment (to within about 10 to 15 
percent) for densities generally not exceeding twice the critical density. This reference 
is an excellent discussion of the modified Enskog theory and experimental transport 
data. 

Ely and McQuarriet have combined the modified Enskog theory with one of the 
statistical mechanical perturbation theory equations of state, thus eliminating the need 
for experimental p-V-T data. They arbitrarily chose the Barker- Henderson equation 
of state, which, under the macroscopic compressibility approximation, reads [Eq. 
( 14 - 32 )] 

^ !. ^ rMry dr ~ m) „ S. sM r) " V) ' J * 

where A is the Helmholtz free energy, r ) is the radial distribution function of a 

system of hard spheres of diameter d,tj = npd 3 /6, 

d(T ) = - \\e~^ T - 1 ] dr 
J o 

For the hard-sphere contribution to A, they use the Camahan-Starling equation of 
state: 


Z 0 = 


(1 + 1 ] + 1] 2 - 1 ] 3 ) 

(i - nf 


* H. J. M. Hanley, R. D. McCarty, and E. G. D. Cohen, Physica , 60, p. 322, 1972. 
f J. F. Ely and D. A. McQuarrie, /. Client. Phys. 9 60, p. 4105, 1974. 
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Figure 19-6. The viscosity of argon versus temperature for a number of densities. (From J. F. Ely and 
D. A. McQuarrie, / Chem . Phys ., 60, 4105, 1974.) 


It is then straightforward to use this equation of state to calculate Y from Eq. (19-48) 
and to use this value of Y to calculate the transport coefficients according to Eqs. 
(19-42) through (19-45). The results are shown in Fig. 19-6, where the viscosity is 
plotted versus temperature for argon at a number of densities. The results are quite 
good for temperatures above critical (150.86°K) and densities below approximately 
twice critical (p c & 0.52 g/cm 3 ). Similar results are found for the thermal conductivity. 

Regardless of the success of this approach, it is really an ad hoc theory and does not 
arise from a rigorous starting point. Consequently, it cannot be improved in an or- 
ganized procedure of successively better approximations. It is still desirable, therefore, 
to develop a molecular theory of transport in dense fluids which begins with the 
Liouville equation and reduces it to a calculable form by means of a series of well- 
defined approximations. We shall present such an attempt in Chapters 21 and 22. 
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PROBLEMS 

19-1. Substitute the Chapman-Enskog expansion 




into the Boltzmann equation and derive the equations for the f in . 
19-2. Prove that 


/ tol (r,v,0-p(M) 


" m 1 3,2 f 

2wkT(T, /)J exp ( 


/w[v— v 0 (r,f)] 2 j 
2kT(r,t) I 


19-3. Defining <f> through f m = j)/ 101 , show that it is determined by Eq. (19-4). 
19-4. Derive Eq. (19-5). 

19-5. Prove that is a linear integral operation. 

19-6. Show that 


ptoi = w Jw/to’rfv 
= pkT\ 

19-7. Show that 


q t0, = y J V x Vf tol dv = 0 


19-8. Prove that when the above expression for p (0) and q <0) are substituted into the 
equations of change [Eqs. (18-51), (18-52), and (17-31)], one gets the ide^l hydrodynamic 
equations. 

19-9. Derive Eqs. (19-9) and (19-10). 

19-10. By comparing q (0) + q (1) to Fourier’s law, show that 

2 k 2 T r 

A = - - A{W)W\W 2 - I)/* 01 dy 

3 mp J 

19-11. By comparing p <0) + p (1) to the Newtonian stress tensor, show that 


T)=- ^ J/t°ip(H 0 (WW - JW 2 I) : (WW - iW 2 \) d\ 


19-12. Prove that 


X(9> b) = 2 arccos^-j b<o 

= 0 b>o 

for hard spheres. 

19-13. Prove that 


n (W / kT\ 1 ' 2 (s + 1)! 

hard spheres \2TTfl) 2 U 


(I) 

hard spheres 


G <0 


hard spheres 




7TU 


and 



19-14. Derive Eqs. (19-24) through (19-26). 
19-15. Show that 
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is a function only of the reduced temperature T*. 

19-16. Derive Eqs. (19-31) through (19-33). 

19-17. Given the equation 

x 75k 2 TT 1 a 12 2 /*n 

A = — — — + 1 + 

8 m \a xl flnfl22-fli2 2 

Show that the second term here is much smaller than the first term. Use the 6-12 potential 
and several reduced temperatures, e.g., T* — 0.50, 1.00, 2.00, and 4.00. 

19-18. Given the equation 

_ 5kT \ 1 ..1 

^ 2 [bn bubiz — b 12 2 J 

Show that the second term here is much smaller than the first term. Use the 6-12 potential 
and several reduced temperatures, e.g., T* = 0.05, 1.00, 2.00, and 4.00. 

19-19. Hirschfelder, Curtiss, and Bird (“Additional Reading”) give the formula [their 
Eq. (8.2-18)] 



V x 


10 7 = 266.93 


(Mr ) 1/2 

a *nW(T*) 


where 

Tj = viscosity in poises 
T = temperature in °K 

* kT 
T* = — 
e 

M= molecular weight 
a = collision diameter in A 
Verify this result 

19-20. Use the formula of the previous problem to calculate the viscosity of argon from 
200°K to 500°K. Compare your results to experimental data (cf. Hirschfelder, Curtiss, and 
Bird, in “Additional Reading,” Table 8.4-2). 

19-21. Hirschfelder, Curtiss, and Bird (“Additional Reading”) give the formula [their 
Eq. (8.2-31)] 


A x 


10 7 = 1989.1 


(TIM) 112 

<7 2 n< 2 - 2 >*(r*) 


where 

A = thermal conductivity in cal/cm • sec • °K 
M = molecular weight 
Verify this formula. 

19-22. Use the formula of the preceding problem to calculate the thermal conductivity of 
argon from 100 K to 600 K. Compare your results to experimental data (cf. Hirschefelder, 
Curtiss, and Bird in “Additional Reading,” Table 8.4-9). 
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19-23. Hirschfelder, Curtiss, and Bird (“Additional Reading”) give the formula [their 
Eq. (8.2-46]). 


£> = 0.002628 


(T 3 /M) l/2 


where 

D = coefficient of self-diffusion in cm 2 /sec 
p = pressure in atmospheres 
M = molecular weight 
Verify this formula. 

19-24. Use the formula in the preceding problem to calculate the coefficient of self- 
diffusion of argon at 1 atm and for 100°K < T < 350 C K. Compare your results to experimen- 
tal data (cf. Hirschfelder, Curtiss, and Bird in “Additional Reading”, Table 8.4-13). 

19-25. Show that the collision integrals obey the recursion formula 


n<,,s+1) = t ~vT + t)n<M) 


19-26. Show that the reduced collision integrals, = n (/ - s) /n H s (/,s) obey the recursion 
formula 


i 

+ n* = £}(/.*)• _L T* 

s + 2 dT* 

19-27. Hattikudur and Thodos* give approximate, simple formulas for £l <l , l) (T*) and 
0 ( 2 . 2 , ( 7 *) f or the 6-12 potential. Show that these formulas do, indeed, reproduce the values 
in Table 19-5. Now using the relation in Problem 19-25, derive approximate formulas for 
n (1 - 2) and H (2 - 3) and compare these to Tables in Hirschfelder, Curtiss, and Bird for T* = 0.3, 
1.00, 2.00, 10.00, and 100.0. 

19-28. Prove that the reduced viscosity is given by 

7]<T 2 

= — — - 
1 (mB ) l/2 

19-29. Prove that the reduced thermal conductivity is given by 

Xu 2 m lt2 
* ~ £ 1/2 


19-30. Prove that the reduced self-diffusion coefficient is given by 


D* 


Dm 112 

OE l/2 


19-31. Using the /n-6-8 parameters given in Table 19-7 and the collision integrals given 
in Appendix E, calculate the viscosity and thermal conductivity of argon from 100°K to 
1000°K. Compare your results to those of the Lennard-Jones 6-12 potential, using the 
parameters given in Table 19-6. 

Calculate the second virial coefficient from both of these potentials and compare. (Use the 
viscosity Lennard-Jones parameters in Table 19-6.) 

19-32. Calculate the viscosity, thermal conductivity, and self-diffusion coefficient of 
argon at STP and compare your results to experimental data. (See Hirschfelder, Curtiss, and 
Bird in “Additional Reading,” pp. 561, 575, and 581.) 


♦/. Chem. Phys. t 52, p. 4313, 1970. 
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19-33. Calculate the viscosity of methane at 300°K and compare your results to experi- 
mental data. 

19-34. Calculate the viscosity of carbon dioxide at 500°K and compare your result to 
experimental data. 

19-35. Calculate the thermal conductivity of krypton at 0°C and compare your result to 
experimental data. 

19-36. Calculate the coefficient of self-diffusion of argon at 0°C and 1 atm and compare 
your result to the experimental value of 0.157 cm 2 /sec. 

19-37. Calculate the diffusion coefficient for the gas-pair nitrogen-oxygen at 0°C and 
1 atm. Compare your result to the experimental value of 0.181 cm 2 /sec. (For the mixed 
interaction parameters a 12 and £ 12 , use the approximate combining rule. 


<*12 = ifai + °l) 

£i2 = (£i£2 ) 1/2 

19-38. In Problem 16-19, we introduced the Eucken correction factor to the thermal 
conductivity. This factor accounts for the fact that a polyatomic molecule carries energy in 
its internal modes as well as in its translational modes. The thermal conductivity is modified 
according to 

. 15 R /4 C v 3\ 

“ T M V \l5 ~R + 5 ) 

Note that the factor in parentheses is unity for a monatomic gas. 

Use this expression to calculate the thermal conductivity of carbon dioxide and methane 
at 300°K and compare your results to experimental data (cf. Hirschfelder, Curtiss, and Bird 
in “Additional Reading,” Table 8.4-10.) 

19-39. Read Section 8.2b of Hirschfelder, Curtiss, and Bird (“Additional Reading”) and 
calculate the viscosity of a binary mixture of argon and neon, verifying the calculated results 
in their Table 8.4-5. In order to do this, one needs to know the mixed interaction parameters 
£ 12 and cr I2 . For these, assume that 


<712= K<7i + o 2 ) 


£i 2 = (£i £ 2 ) 1/2 


19-40. Using the collision integrals given in Table 19-5, calculate the quantities f x and /* , 
through the second term and compare your results to the three-term expression for f x 
and f n given in Table 19-1. 

19-41. Using the collision integrals in Table 19-5 and the extended formulas for f x and f n 
given either in Hirschfelder, Curtiss, and Bird (“Additional Reading”) or Appendix I of 
E. A. Mason,* verify the entries in Table 19-1. 

19-42. Verify the entries in Table 19-3 for the viscosity and thermal conductivity. Use the 
extended formulas for f x and /„ given in Appendix 8A of Hirschfelder, Curtiss, and Bird 
(“Additional Reading”) or Appendix 1 of E. A. Mason,* 

19-43. Kihara (cf. Hirschfelder, Curtiss, and Bird in “Additional Reading,” Appendix 
8A) has developed alternate expressions for the higher approximations to the transport 
coefficients. Unlike Chapman and Cowling, who express the transport coefficients in terms 
of ratios of infinite determinants which are then truncated, Kihara notes that the off-diagonal 


* J. Chem . Phys ., 22, p. 169, 1954. 
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elements in these determinants are small, and so he expands the infinite determinants in 
powers of the off-diagonal terms. For pure gases, Kihara obtains the results 


y*^(KIhsra) 


/A (KIh.r -,_ 1 + 

and 

y D (KIhara) = J + 

where 

A* 

and 


3 ptf* 2 
“ 1 + 49 [ 

2 r 4 a< 2 - 
21 [n<*- : 


3 )* 

2 )*~ 


3 )* 

*>*" 


(6 C* - 5) 2 
16 A* + 40 

n< 2 - 2) * 


c* = 


aw 


nd.D* 


g 

3 


2 


2 


Evaluate these correction factors and compare your results to Table 19-1. 

19-44. Joshi* has developed a procedure for evaluating the correction factors f, and /*, in 
which the actual intermolecular potential energy function is considered to be a perturbation 
over the rigid spherical one. Use Eqs. 6 and 7 of Joshi’s paper and calculate some of the 
entries in his Table 1. 

19-45. Show that the exp-6 potential has a spurious maximum (at r m „) and then ap- 
proaches — oo as #*-*0. How would you remedy this in doing calculations with this 
potential ?t 

19-46. Consider the integral equation 
| K(x,y)f(x)dx = h(y) 

where K{x, y) and h(y) are known functions and fix) is to be determined. Now expand both 
f(x) and h(y) in a set of functions that is orthonormal in the interval (a, b ): 

fix) = 

Ky)=J.hj<f>/y) 

J 

Since fix) is not known, the Cj are as yet undetermined, but the hj are given by 
hj = j h(y)<f>j(y) dy 

Multiply the integral equation by <f>jiy) and integrate; substitute the expansion for fix) into 
the result; and derive the infinite set of algebraic equations 

bj=J t c l a u 

j 

where 

b 

(h) = JJ K(x, y)4>ix)4>j{y) dx dy 


* Chem. Phys. Lett., 1, 575, 1968. 

t Compare E. A. Mason, J . Chem. Phys., 22, p. 169, 1954. 
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19-47. Repeat the previous problem, but this time assume that /(*) and h(y) can be 
represented by a finite number of terms 

/(*) = 2 cj 4 > j ( x ) 

J=o 

Ky)= 2 MjOO 

J = o 

Show how this leads to a finite set of algebraic equations. 



CHAPTER 20 


THEORY OF BROWNIAN 
MOTION 


The last few chapters have treated the transport theory of dilute gases. We have been 
able to derive essentially exact relations between the various transport coefficients and 
the intermolecular potential. A number of the more recent and successful theories of 
transport in dense fluids are based upon the ideas of Brownian motion, and so in this 
chapter we shall digress into the theory of Brownian motion. This is a rich and beauti- 
ful field in itself. It was developed many years ago by Einstein, Planck, Smoluchowski, 
and others, and much of this chapter is based on the classic review article by Chan- 
drasekhar (“Additional Reading”). 


20-1 THE LANGEVIN EQUATION 

In 1827, the botanist Robert Brown discovered that small pollen grains immersed 
in a fluid undergo a perpetual irregular motion. This irregular motion is now known 
to be due to incessant collisions of the pollen particle with the molecules of the sur- 
rounding fluid. To an observer, the trajectory of such a Brownian particle will appear 
as an irregular, random path. We assume that the force on such a particle can be split 
up into two parts. The first is a frictional force due to the drag exerted on the particle 
by the fluid. If u represents the velocity of the particle, then this force is assumed to be 
of the form given by Stokes’ law, i.e., —y'u where / is the friction constant. / is given 
by 6narj 9 where a is the radius of the particle and tj is the viscosity of the medium. The 
second part of the total force acting upon a Brownian particle is assumed to be the 
fluctuating force, A'(0* This random force represents the constant molecular bom- 
bardment exerted by the surrounding fluid. We furthermore assume that A '(t) is in- 
dependent of u and that A'(f) varies extremely rapidly compared to the variations in u. 
This second assumption says that time intervals At exist such even though u (t + At) 
— u(r) is expected to be very small, no correlation between A'(f) and A'(t + At) exists; 
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i.e., A'(0 will have undergone many fluctuations. The Langevin equation is simply the 
equation of motion for such a Brownian particle, namely, 

m -y- = —y'u + A'(0 
dt ' 

or more conventionally, 

■jj“ = — C u + A(t) (20-1) 

where C = ylm and A (t) = A'(t)/m. A differential equation such as Eq. (20-1), where 
one of the terms is a randomly varying function, is called a stochastic differential 
equation. 

We must now solve this equation. What we mean by solving a stochastic equation 
is to find the probability that the solution is u at the time t, given that u = u 0 at t = 0. 
This is given by the probability density function W( u, t \ u 0 ). Clearly we require of W 
that 


FF(u, t ; u 0 ) ->• b(u x - u x0 )5(u y - u y0 )b(u z - u z0 ) t -v 0 


( 20 - 2 ) 


Furthermore, we expect that as t -► oo, the particle must be in equilibrium at the tem- 
perature of the surrounding fluid, and so u, t; u 0 ) must approach a Maxwellian 
distribution, independently of u 0 , as t -* oo ; i.e.. 


W{ u, t ; u 0 ) 



(20-3) 


Equation (20-1) is a first-order linear differential equation and can be solved 
formally to give (Problem 20-1) 

U = u (t) — u 0 e~ Qt = e~ Qt f e K A(£) d£ (20-4) 

•'o 

This shows that the statistical properties of u — u 0 e~ Xtt must be the same as 


( e“A (Qd£ 

J o 

If we assume that the ensemble average of A(t) is zero, then taking the ensemble 
average of both sides of Eq. (20-4) gives 

<u>=u 0 c _Ct (20-5) 

We can also readily find <U 2 > by squaring both sides of Eq. (20-4) and then 
ensemble averaging both sides to get (Problem 20-2) 

<C/ 2 > = <m 2 > - u 0 2 e~ 2 ^ = <T 2Ct f f e c(, ' +, " ) <A(<') • A(f")> dt' dt" (20-6) 

•'o •'o 

Now <A (?) • A (t")y represents the correlation of A at time t f with A at time t”. Since 
A is a rapidly varying function, we assume that <A (/') • A (t")> is a function only of 
\t r — t" | and is nonzero only when | t r — t"\ is small; i.e., we assume that 

<a(/')-A(/")> = <M I *'-*"!) 


(20-7) 
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where (| t' — t"\) is very peaked at t' = t". We then let t' + t" = x and t' — t" = y 
in the integral in Eq. (20-6); we get (Problem 20-3) 

<C/ 2 > = ie~ 2( ‘ f e ix dx f 0,O>) dy 

J 0 J -oo 

The limits ± oo have been used since (j>i(y) is a very rapidly decreasing function. We 
can let the integral of $j(y) be a constant t to get 

< u2 >=^(l~ e ~ 2it ) ( 20 - 8 ) 

We can evaluate t by realizing that equipartition must apply when t -► oo. This gives 
3 kT t 


or 


VcT 

<t/ 2 >= (1 - e ~ 2Ct ) (20-10) 

m 

Using Eq. (20-5), we may write this equation in the form 



which shows how <w 2 > approaches its equipartition value. It is actually possible, 
although fairly tedious, to continue this process and calculate all the moments of U. 
The result is (Problem 20-4) 

<C/ 2n+1 > = 0 

<C/ 2 "> =1 3-5 - ■ • - (2n — 1)<U 2 > 


which shows that the probability distribution of U is Gaussian (Problem 20-5), or that 


f m 1 3/2 


m|u — u 0 e Cr | 2 " 
2kT(l -e" 2 *). 


( 20-1 2 ) 


It can be readily seen that this becomes a Maxwellian distribution independently of 
u 0 as t — ► oo. 

In the very same way we can go on to consider the probability distribution function 
of the displacement r instead of the velocity u. Since 


r-r 0 =fu(O dt' 

we have, according to Eq. (20-4), that 

r — r 0 = J* + e-«' J* dt" e^A(t ,r ) 

or 

r - r 0 - C" ‘u^l - e "*) = f ' df e~v ( dt" e^A (f) 

J o J o 



THE LANGEVIN EQUATION 455 


We can simplify the right-hand side of this equation by integrating by parts to give 
r-r 0 -C~ ‘u 0 (l - e~ p ) = f ‘[1 - dt' ( 20 - 1 3) 

J o 

By taking the ensemble average of both sides, we get (Problem 20-6) 

(1 - e~«) 


<r - r 0 > = u 0 


C 


By squaring and then averaging Eq. (20-13) in the same way that we did before 
(Problem 20-7) 


<|r - r 0 | 2 > = ^ (1 - + j^(2C< - 3 + 4e~« - e~™) 


(20-14) 


Just as for u — u 0 e it is possible to show that r - r 0 — ( ‘u 0 (l — e ?r ) is Gaussianly 
distributed, i.e., that (Problem 20-8) 


W ( r 


, t; to, u 0 ) = £ 


mC 2 


3/2 


2knT(Xt - 3 + 4e~ t ‘ t - e" 25 *). 


mC 2 |r-r 0 -C Vl-e ^l 2 ] 
. 2kT(2Ct - 3 + 4<T ?t - e ~ 2lt ) J 


(20-15) 


Actually, Eqs. (20-12) and (20-15) are special cases of the general theorem, proved 
in Chandrasekhar’s review article, which states that if 


r= r w)^ 

J o 


then the probability distribution of R is given by 


R) = 


l m Jo 


3/2 


exp 


R J 


4 CkT 


m 


( 20 - 1 6 ) 


(20-17) 


Equation (20-14) has two interesting limiting conditions. When t is very small, we 
have 

<|r-r 0 | 2 >- |w 0 2 |< 2 *-»0 (20-18) 


which simply says that 



for small t. On the other hand, for large times, we have 


l2 . 6kT _ ^ 
< l r — r ol > = ~~^ t = 6Dt 


( 20 - 1 9 ) 


where we have introduced the diffusion constant, D = kT/m^, This result was first 
derived by Einstein and later verified by Perrin in a famous series of experiments on 
Brownian motion. It has more recently been illustrated in molecular dynamics calcu- 
lations by Rahman* on a system of particles obeying a Lennard- Jones 6-12 potential. 
This is shown in Fig. 20-1. 


A. Rahman, Phys. Rev. y 136A, p. 405, 1964. 
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Time (1CT 12 sec) 

Figure 20-1 . A molecular dynamics calculation illustrating Eq. (20-19). Note now initially the mean- 
square displacement varies as / 2 , but then goes into a linear dependence as t becomes larger. 
(From A. Rahman, Phys. Rev., 136A, p. 405, 1964.) 


So we see that for very short times the mean-square displacement goes as f 2 , a result 
dictated by classical mechanics, but after a time long compared to C -1 , the statistical 
nature of the problem takes over, and the mean-square displacement goes linearly in 
time. Note the implications here concerning reversibility and irreversibility. 

Equation (20-15) also has an interesting limit for t > C _1 - In this limit we can ignore 
£ -1 u 0 (l — e -Cf ) compared to r — r 0 , which increases with increasing t , and get 
(Problem 20-10) 


W(r 9 f;r 0 ,u 0 ) 


1 . J l r ~ r o| 2 ) 

(47t£>0 3/2 XP i 4 Dt | 


t>r 


( 20 - 20 ) 


This is the well-known solution to the diffusion equation, which goes to 8 (t — r 0 ) as 
t —> 0 [cf. Eq. (17-53)]. In the next section we shall derive generalizations of the 
diffusion equation that give Eqs. (20-12) and (20-15) as their fundamental solutions. 
These “diffusion equations” will govern diffusion in velocity space and phase space 
itself rather than just diffusion in configuration space. 


20-2 THE FOKKER-PLANCK EQUATION AND 
THE CHANDRASEKHAR EQUATION 

We shall now show how to determine Eqs. (20-12) and (20-15) as solutions to 
appropriate boundary-value problems of certain partial differential equations. We are 
particularly interested in an equation governing W(r, u, t\ the density function in the 
phase space of a single particle since one of the more successful theories of transport 
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in dense fluids approximates the reduced Liouville equations for / (1) and / (2) by 
diffusion equations in the six- and twelve-dimensional phase space of one and two 
particles.* 

First, however, we shall derive a differential equation for just W(u, t). This differen- 
tial equation, the diffusion equation in velocity space, is known as the Fokker-Planck 
equation. 

Let At be a time interval long enough for A (t) to undergo many fluctuations but 
short enough so that u (t) does not change appreciably. As before, we assume that such 
a time interval exists. Under these circumstances, we should expect to derive W( u, 
t + At) from W(u, i) and a knowledge of the transition probability ¥( 11 ; Au), the 
probability that u undergoes a change Au in the interval A t. In particular, we can write 

W(u, t + At) = J W(u — Au, ty¥(u — Au; Au)d(Au) (20-21 ) 

Equation (20-21) is a fundamental equation in probability theory. It states that the 
course of the system at a time t depends only upon the instantaneous state of the 
system at the time t and is independent of its previous history. A probabilistic or 
stochastic process that has this characteristic is known as a Markov process . Equation 
(20-21) is the fundamental equation of Markov processes and is called the Chapman - 
Kolmogorov equation . 

We now expand W( u, t + At), W ( u - Au, t), and ^(u - Au; Au) in Taylor series. 
dW 

W(u, t) + — At + 0(At 2 ) 


t cW 1 d 2x ¥ d 2x ¥ 1 

*■> - ? v 4 “‘ + 2 ? v iu ' + • ■ • ■] 

x d(Aui)d(Au 2 )d(Au 3 ) (20-22) 

In Eq. (20-22), 0(At) 2 means terms such that 0(At 2 )/At — ► 0 as Af — > 0. Write 

<A u,y = f Am,- *F(u; Ad)d(Aa) 

^ — co 

<Am, 2 > = f Au 2 ^(u; Aa)d(Au) 

^ — co 

< Am ; Amj) = f” Au i Au J 'i’(u; Aa)d(Au) (20-23) 

Using these definitions then, Eq. (20-22) becomes 

^ A t + 0(At 2 ) = -Z <Am,> + <Am/> + Z <A«i Auy> 

dt j duj 2 j au/ Kjduiduj 


KA uj> 


8W _ dW 3<Am, Aw,) 




* S. A. Rice, and P. Gray, The Statistical Mechanics of Simple Liquids (New York: Interscience, 1965); 
R. M. Mazo, Statistical Mechanical Theories of Transport Processes (Oxford, Pergamon, 1967). 
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where the remaining terms involve quantities <Aw y 3 >, <Aw, Aw y 2 >, and <Aw, Aw y Au k y 
and higher-order terms. Equation (20-24) can be written more conveniently in the 
form 

IT A ' + ^ + 5 ? 

+ I A ^» + • • • (20-25) 

i<j OUl OUj 

This is the Fokker-Planck equation in its most general, albeit unusual, form. We must 
now evaluate the quantities in Eqs. (20-23). Equation (20-5) gives us <Aw j >. This may 
be readily seen by writing Eq. (20-5) in the form 

<Au> = u(e~‘ At - 1) 

to give (Problem 20-12) 


<Aw y > = — C w y A* + • - • 

Similarly, Eq. (20-11) gives (Problem 20-12) 


<a u/y 


2tkT 


A t + 


m 


and if we assume that different components of A (t) are uncorrelated, we have 


<Am £ Aw y > = 0 i+j 

Equation (20-25) then can be written in the form 

At + 0(At 2 ) = [c div n (Wu) + V n 2 w\ At 4* 0(At 2 ) 
ot L m J 


which, in the limit Af->0, becomes the Fokker-Planck equation: 
dW IkT 

— = C div n (Wu) + - — V n 2 lF (20-26) 

ot m 

It can be readily shown by substitution that Eq. (20-12) is the fundamental solution 
to this equation, i.e., the solution that tends to 8(u — u 0 ) as t -► 0 (Problem 20-13). 
This is the diffusion equation in velocity space. We are now ready to derive the diffusion 
equation in phase space. 

Again let At be an interval of time long enough that A (t) undergoes many fluctua- 
tions but short enough that neither u nor r changes appreciably. Then we can write 

Ar = u At 

and 

Au = — (Cu — K) At + B(A t) (20-27) 

where K denotes an external force (per unit mass) and B(At) the net acceleration arising 
from fluctuations that occur in time At. Assuming that diffusion in phase space is a 
Markov process, we can write 

W( r, u, t + At) - ff W(t — At, u - Au, t)\J/(r — Ar, u — Au; Ar, Au)d(At)d(Au) 

(20-28) 
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Because of Eqs. (20-27) we write 

^(r, u; Ar, Au) = v P(r, u; Au)<5(Ar — u At) (20-29) 

where V(r, u; Au) is the transition probability in velocity space. With this form for 
ip(r, u; Ar, Au), we can integrate over Ar in Eq. (20-28) to get 

W(r, u, t) + At) = J W(r — u At, u — Au, t)*P(r — u At, u — Au; Au)d(Au) (20-30) 

or, equivalently, 

lT(r + u At, u, t + At) = J W(j, u — Au, Aty¥(r, u — Au; Au)J(Au) (20-31 ) 

We can now derive the phase space analog of Eq. (20-22) by expanding the various 
functions in Eq. (20-31) in Taylor series to obtain 

( /\ JTF % /\ ^ 

— + u • V r Wj At + 0(At 2 ) = -I ^ (W<Amj» + 2 £ a 72 (^<Au/» 

^ 3 2 

+ E 7 — (W^<Au; A uj» + ■■■ (20-32) 

i<j 0Ui CUj 

This is the Chandrasekhar equation in its general form. We can calculate <A Wj>, 
<A ufy, and <A w, Awy) from Eqs. (20-27). 

The second of Eqs. (20-27) shows immediately that 


<Auj>=-ttUj-Kj)At + --- 

To find <A ufy, we proceed as we did in the derivation of the Fokker-Planck equation 
and get (Problem 20-20) 

2tkT 


<AWy 2 > 


A t + 


m 


and 


< Aw £ Awy> = 0 i+j 

Therefore, Eq. (20-32) simplifies to in the limit At 0, 
dW tkT 

—J— + u-V r W + K = C div n (^u) + y u 2 W (20-33) 

ot m 


This equation is the generalization of the Fokker-Planck equation to phase space. 
It is often referred to as the Chandrasekhar equation. Notice how the left-hand side of 
this equation is identical with the left-hand side of the reduced Liouville equation for 
/ (1) [Eq. (18-14)]. The right-hand side of Eq. (20-33) is the Brownian motion repre- 
sentation of the collision terms in the Boltzmann equation. Both the Kirkwood and 
Rice-Allnatt theories* of transport in dense fluids derive Chandrasekhar-type equa- 
tions for/ (1) and / <2) or at least the parts of / (1) and / <2) due to the long-range part of 
the potential. They derive these equations directly from the reduced Liouville equations 
for/ (1) and / (2) . 

We have derived the diffusion equation in velocity space and the diffusion equation 
in phase space. Before concluding this chapter let us derive the diffusion equation in 


* S. A. Rice, and P. Gray, The Statistical Mechanics of Simple Liquids (New York: Interscience, 1965); 
R. M. Mazo, Statistical Mechanical Theories of Transport Processes (Oxford: Pergainon, 1967). 
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configuration space, the “ordinary” diffusion equation. Equation (20-20) says that 
for At > £ - \ the transition probability that r changes by Ar in the time interval At is 


Assuming then that the process is Markovian for At > £ _1 , we can write 
WP(r, t + At) = J W(r - Ar, t)V(Ar)d(Ar) 

Applying the same procedure we have applied twice before, we get 


(20-34) 


(20-35) 


(20-36) 


the ordinary diffusion equation. The fundamental solution of this equation is Eq. 
(20-20). (See Problem 20-15.) 
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PROBLEMS 

20-1. Show that a formal solution to the Langevin equation 

du 

-=-£« + A (0 

is 

u (0 - u 0 e‘ cr + f e«A(f) d£ 

J o 

20-2. Derive Eq. (20-6). 

20-3. Show that <t/ 2 > is given by 

<L/ 2 > = <w 2 > — ~ &e~ 2Cr f e Cx dx f (y)dy 

^ 0 ^ - oo 

where <A(rO • A(/")> = MU'- 1"\)- 
20-4. Prove that the moments of U = u - u 0 e _f ' are given by 
= 1-3-5 (2 n- 1 )<t/ J > 

< [/ 2. + l> o 
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In order to evaluate these higher moments, one must make assumptions about the higher- 
order correlation functions <A{ti)A{t 2 ) • • • A(tj)y. It is a common assumption to assume 
that the random process A(t) is Gaussian, which amounts to assuming that* 


<A(ti)A(t 2 ) • • * A(t 2 „ +1 )> = 0 

<A(t l )A(t 2 ) • • • A(t 2 *)> = 2 <A(t,)A(tj)> <A(t k )A(t,)> • • • 

all pairs 


20-5. Prove that the result for the moments of U given in the previous problem implies 
that 


JV(u, t\ u 0 ) 


-[ 


m 


2irkT{\ - e~ 2Ct ) 


' 3/2 r 
eXP 


m\n — u 0 ^~ Cf | 2 
2kT(\-e- 2it ) 


20-6. Show that 


<(r — r 0 )> = 


u 0 (l - e-t*) 

i 


20-7. Show that 

u 2 3kT 

< | r - r 0 1 2 > = (1 - e-“) 2 + ^5 <2t* - 3 + 4e - e~ 2 «) 

20-8. Prove that the distribution of r — r 0 — £ -1 u 0 (l — e~ ct ) is given by Eq. ( 20 - 15 ). 
20-9. Using the theorem given in Eqs. ( 20 - 16 ) and ( 20 - 17 ), derive Eqs. ( 20 - 12 ) and ( 20 - 15 ). 
20-10. Derive Eq. ( 20 - 20 ). 

20-11. Chandrasekhar (“Additional Reading”) proves the following lemma. Let 

R — fV(f)Atf)<£ 

J 0 

and 


Then the joint probability distribution of R and S is given by 

1 r (GR 2 - 2HR ■ S + FS 2 )1 

^ (R,S) 8tt 3 (FG - H 2 ) 3 ' 2 exp [“ 2 {FG-H 1 ) J 

where 

F 2q f ^)di 
J 0 

G~2q[ <j>X£)d£ 

" 0 

H- 2q\ WmS)# 

J 0 

where q = IJcT — y'kT/m. 


* See M. C. Wang and G. E. Uhlenbeck, Rev. Mod . Phys., 17, p. 323, 1945. 
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Letting 

R = r — r 0 — £ _1 u 0 (l — e~ v ) 

S = u — u 0 e -f ' 
show that 

, F(f) = i- l [l-« < ( <{ - 0 ] 

and that 

F= qi-\2it - 3 + 4e~« - e~ *') 

G = qi~\ l-e- 2 “) 

H = qi~ 3 (l-<r ‘ f ) 2 

20-12. Derive the equations 
<A u j> = - £uj Ar + • • • 

2tJcT 

<Aw/> = A *+•■• 

m 

Hint: Remember that A u 2 = (uj — u 0J ) 2 . 

20-13. Show that Eq. (20-12) is the fundamental solution of the Fokker-Planck equation. 

20-14. Derive Eq. (20-36) by starting with the Chapman-Kolmogorov equation and 
Eq. (20-34). 

20-15. Show that Eq. (20-20) is the fundamental solution to Eq. (20-36). 

20-16. Show that the distribution given in Problem 20-11 is the fundamental solution of 
the Chadrasekhar equation. 

20-17. Show that if we set u 0 2 = <«o 2 > in Eq. (20-14), then 
6kT 

<|r — r„| I > = ^(ir— 

20-18. Assuming that u 0 and A(£) are uncorrelated, show that the time correlation 
function of the velocity, defined as < u(f) ■ u(0) >, is given by 

<u(f) • u(0)> = < « 2 (0) >e- Cf 

3 kT 

— — e~ 
m 

We shall see in Chapter 21 that such time correlation functions play a central role in modem 
theories of transport. 

Noting from Eq. (20-19) that the self-diffusion coefficient can be expressed as D = kT/m £, 
show that 

j>-^J <u( / ) u(o)> ^ 

We shall derive this result in Chapter 21 . 

20-19. The relation between the coefficient of self-diffusion and the velocity time correla- 
tion function suggested in the previous problem can be derived from the considerations 
given in this chapter. Consider Eq. (20-19): 

D = lim i<|r(f)-r(0)| 2 > 

f — ♦ oo O * 
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Using the fact that 
r(/) = r(0)+ 

J o 

show that 

D = lim ^ f dt’ f dt" <u(0 • u(f")> 
r-» oo 6 1 J 0 J 0 

Using the stationarity of the equilibrium ensemble average and time reversibility, show that 

<u(r') • u(r")> - <u (/' - n • u(0)> = <u (/" - n * u(0)> 

Now change the integration variable tor = t" — t' and interchange orders of integration to 
get 


D = i lim J ( l “ 7 ) dt 

If the velocity time correlation function decays sufficiently rapidly, we can finally write 
= <u(0) • u(t)> dr 

We shall discuss such expressions in great detail in Chapter 21. 

20-20. In deriving the Fokker-Planck equation, an intermediate step was to prove that 

<Aw 7 > = — KJij At -\ and that <A «/> = (2 IkTjm) Ar H (Problem 20-12). An alternate 

derivation of this equation (cf. Chandrasekhar in “Additional Reading”) argues that the 
transition probability ^(u — Au; Aw) in Eq. (20-21) has the form 

*F(u; Au) = (47 rq A t)~ 3/2 exp 

where q — tJcT/m. Show that this leads to the same results as those given in the chapter. 

20-21. Equation (20-9) is a very interesting equation, whose significance should be pointed 
out. Show that this equation can be written as 


|Au + pu A/| 2 1 
4 q A t j 


where we have recognized the fact that the random process represented by A(r) is stationary. 

This is a fundamental result, whose significance we shall discuss fully in Chapter 21. 
Nevertheless, note here that if the Langevin equation represents a Brownian particle randomly 
moving about in thermal equilibrium, then the friction constant £ is related to the fluctuations 
of the random force. This is a simple example of a famous, fundamental theorem called the 
fluctuation-dissipation theorem . 

20-22. The Langevin equation can be extended to include the case of Brownian motion in 
an external field. In this case the Langevin equation becomes (in one dimension for simplicity) 


du 


ev 


m — = — y u — — 
dt r dx 


A\t) 


Show that the Fokker-Planck equation corresponding to this is 


/ e d 1 ev a\ 

U dx m dx du) 



where £ — y'jm and D u = IkTjm. 
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20-23. Extend the analysis of the previous problem to three dimensions. 

20-24. Show that the relation between the friction constant £ and the force correlation 
function (Problem 20-21) is also valid when there is an external force. 

20-25. Show that the fundamental solution to the multidimensional Fokker-Planck 
equation 


dW 

nr 




d 2 w 

dxt dxj 


is a multivariable Gaussian distribution with average values 

Oo> = x J0 e Xjt 

and variances 

<(*. - - <xj»> = - [I - e iX ‘ +XjU ] 


20-26. Let y(t) represent a random variable whose average value is zero. Furthermore, 
let y{t) be such that its statistical properties are independent of the origin of time. Such a 
process is said to be stationary. Define the frequency spectrum of y{t) to be 


G(o>) = f dr <Mt)y(t + r)> COS wr 
J o 

Convince yourself that this is independent of t. 
Show that if y(t) satifies the Langevin equation 

dy 

and 

<F(ti)F(ti)y = cS(t, - h) 


then the frequency spectrum of y(t) is 


CM 


c/2 


20-27. Tn this and the next few problems we shall illustrate several methods of solving the 
Fokker-Planck equation. For simplicity consider the one-dimensional case 


8W „ 

— -£K'+{o 


(It 

du 


IJcT d 2 W 
m du 2 


We wish to find the fundamental solution, i.e., the solution for no boundaries and for the 
initial condition 


W(u 9 1 ; w 0 ) = 8(w — Wo) t-> 0 

Except for the last term on the right-hand side of the Fokker-Planck equation, the equation 
would be a linear first-order partial differential equation, which is fairly straightforward to 
solve. It is natural to expect that a study of the first order-equation will give insight into the 
solution of the full Fokker-Planck equation. To this end, let d 2 lV/du 2 = 0 for now to get 


BW dW v 

ir-o-sr-w' 


Multiply this by an integrating factor /a(w, t) and compare the result to the perfect differential 
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Show from this comparison that we would have an exact differential if dt — fi 9 du^ — \iX>u 9 
and dW = from which one writes 

u = c t e~ lt and W — c 2 ^ f 

This suggests that the variables v = ue* and x = We~ it be used in the Fokker-Planck equa- 
tion. Show that this transformation gives 


dt m dv 2 


Show that this equation now becomes of the form of a simple diffusion equation if we 
introduce a new time variable r such that dt = e~ ut dr 9 or 

(e 2it - 1) 


2 £ 

Show that the fundamental solution to this diffusion equation is 

T (» — »o) 2 l 

x(p, r) = (4 nqr) 112 expl ^ j 

where q = tJcTjm. 

Thus finally show that the fundamental solution to the Fokker-Planck equation is 

f m 1 1/2 f m{u — w 0 e -c, ) 2 l 

W{u, t;u o) - [ 2 77/cT(l -«-«')] ® XP l 2kT(l -e" 2 ")] 

20-28. Again we consider the one-dimensional Fokker-Planck equation for simplicity: 

aw a a 2 w 

where q = £kT/m. 

Define the Fourier transform of W by 


r 00 

t ) = (2t t) ‘ l ' 2 J W(u 9 t)e ,su 


du 


Show that W(s 9 t ) satifies the equation 
dW 9 „ dW 

Show that the term in W can be eliminated by the substitution (see the next problem) 

V(s 9 t) = W{s 9 t)e* 2 ' 2 < 
to get 

dV dV 

~dt~ ~^ S l)s 

Show that this first-order linear partial differential equation has the general solution 
(I. Sneddon, Partial Differential Equations, New York: McGraw-Hill, 1957) 




where f(x) is a differentiable function of x. 
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Using the initial condition that 
W(u, 0) = S(u — uo) 
show that 

fr ( s , 0) = (2tt 
and that 



Lastly show that 

fris, t) = (27t) _ 1/2 expj^— ^ (1 — e ~ 2it ) -f iM/ 0 e” c, j 
whose inverse is 

* T £ l l/2 T ( u-uoe -") 2 

fr(u, t; Uo) - \2-nq(\ - e- 2 «)\ ® XP [ _ 2q(l - e" 2 «')/i 

20-29. Show that the substitution 
6 = <j>e~ ht 

reduces the equation 
dO d 2 0 


to the one-dimensional diffusion equation. 
20-30. Show that if 

dy 

then the fundamental solution to this equation is 

«-KH “ p [^ ipi] 

Note how this was used in Problem 20-27. 



CHAPTER 21 


THE TIME-CORRELATION 
FUNCTION FORMALISM, I 


The 1950s saw the beginning of the development of a new approach to transport 
processes that has grown into one of the most active and fruitful areas of nonequilib- 
rium statistical mechanics. This work was initiated by Green and Kubo, who showed 
that the phenomenological coefficients describing many transport processes and time- 
dependent phenomena in general could be written as integrals over a certain type of 
function called a time-correlation function. We shall see shortly that these time- 
correlation functions play a similar role in nonequilibrium statistical mechanics that 
the partition function plays in equilibrium statistical mechanics. Up to now we have 
presented no rigorous general formalism of transport processes, but we shall finally do 
so in this and the next chapter. 

Before considering any specific development, we shall first discuss the concept of a 
time-correlation function. We shall do this in the classical limit, but there is a quantum- 
statistical analog. Let p(t) and q(t) denote all the momenta and spatial coordinates 
necessary to describe our system, and let p = p{ 0) and q = q(0) denote the phase space 
coordinates at some initial time, t = 0. The p(t) and q(t) are related to the p and q 
through the equations of motion of the system. To emphasize this, we write 

p{t)=p{p,q\ t) 
q(t) = q(p>q i 0 

Let A{p(t ), q(t)} be some function of the phase space coordinates. By Eqs. (21-1) 
then, we can write 

A{p(t), q{t)} = A(p, q; t) = A(t) (21-2) 

We define a classical time-correlation function of A(t) by 

C(t) = <A(0)A(t)) =f fdpdq A(p, q; 0)A(p, q; t)f(p, q) (21-3) 

where f(p, q) is the equilibrium phase space distribution function and where dpdq 
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stands for dp x • • * dp N dq x • • • dq # . If A(t) is a vectorial function, such as velocity or 
momentum, then Eq. (21-3) becomes 

C(t) = <A(0) • A(f)> = J jdpdq A (p, q; 0) • A (p, q\ t)f(p, q) (21-4) 

For simplicity, let us consider the simple case in which A(t) is the velocity of some 
particular “ tagged ” molecule. Then the correlation function is 

C(t) = <v(0) • v(t)> (21-5) 

We first point out that in order to evaluate C(t) exactly even in this simple case, we 
would have to solve the equations of motion of one particle immersed in a system of 
others and then average the initial conditions over some equilibrium ensemble such as a 
canonical ensemble. Since v(t) depends upon the momenta and positions of many 
other particles in the system, this is clearly a very difficult calculation to do exactly. 
What, then, is the advantage of this time-correlation function formalism? 

One advantage is that the resulting formulas for the transport coefficients are 
quite general in the sense that they do not depend upon the details of any particular 
model and are not limited to any particular density region. For example, we shall 
show in Section 21-8 that the coefficient of self-diffusion can be expressed in terms of 
the above velocity time-correlation function by 

D = (21-6) 

J o 

This expression is valid for any density, for angle-dependent intermolecular forces, 
for polyatomic molecules, and generally for any classical system in which the diffusion 
is governed by the diffusion equation. Thus we see why we say above that the time- 
correlation function plays a somewhat similar role in nonequilibrium statistical 
mechanics that the partition function plays in equilibrium statistical mechanics. The 
analogy breaks down in one respect, however. Since the state of thermal equilibrium is 
unique, a single partition function gives all the thermodynamic properties, but since 
there are many different kinds of nonequilibrium states, we shall need a different 
time-correlation function for each type of transport process. For example, although the 
self-diffusion coefficient is given in terms of the velocity time-correlation function, 
other transport processes are given in terms of other time-correlation functions. Much 
of the discussion in this and the next chapter is concerned with determining what is the 
appropriate time-correlation function to use for a particular transport process of 
interest. 

Another advantage of this formalism is that even though it may be prohibitively 
difficult to calculate a time-correlation function exactly, we at least have exact equa- 
tions that we can start with to make sets of approximations. We use the same procedure 
in equilibrium statistical mechanics where we start with a partition function which is 
also prohibitively difficult to calculate exactly and introduce various well-defined 
approximations. The nature of a time-correlation function lends itself very nicely to 
this procedure. Consider the velocity correlation function [cf. Eq. (21-5)]. When 
t = 0, C(t) = <v(0) • v(0)>, which is simply the equilibrium average of v 2 , which equals 
3kT/m by equipartition. As time evolves, the particle will suffer collisions, and as we 
average over all these collisions, its velocity will change both in direction and magnitude 
and the velocity at time t , v(t), will be less and less correlated with its initial value 
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v(0). After a number of collisions, the velocity at that time will be completely uncorre- 
lated with the initial velocity, and C(t ) will equal zero. Thus we expect the velocity 
correlation here to start at its initial value C( 0) = 3kTjm and to decay to zero. It is 
not an uncommon approximation to assume that C(t) decays exponentially with some 
time constant r, i.e., to assume that C(t) = (3kT/m)ex^(— t/r). We shall see in Sec- 
tion 21-9 that an approximation of this form can lead to the Enskog theory of trans- 
port in dense hard-sphere fluids. Although we have discussed the velocity correlation 
function as an example here, the same argument holds for most any time-correlation 
function. 

It so happens that the time-correlation function formalism is more general than the 
above discussion implies. We have discussed its application only to thermal transport 
processes, but it is applicable to many other types of processes. A system that is 
subject to a time-dependent perturbation will produce a time-dependent response to 
this probe, which can be Fourier analyzed into what is called a frequency-dependent 
susceptibility. For example, a time-dependent external electric field acting across a 
conductor causes a time-dependent current to flow. The current depends upon the 
nature of the medium and the frequency of the field and can be described by a frequency 
dependent conductivity a(coi). We shall show in Section 21-4 that o(co) is given by 

<co) = L j”dt e~ ioi, (J(0)J(t)) (21 -7) 

where 

j 

where q } is the charge on the y'th particle and Vj is its velocity in the direction of the 
field. 

Note that this formula differs from Eq. (21-6) in that it contains the additional 
factor e~ i<ot in the integrand. [We call the integral in Eq. (21-7) a Fourier-Laplace 
transform.] Equation (21-7) is more representative of the type of formula that results 
from time-correlation function theory. Equation (21-6) for D implies that this expres- 
sion is the zero-frequency limit of some more general quantity. Thus we say that an 
ordinary thermal transport coefficient such as the self-diffusion coefficient is a zero- 
frequency result, meaning that it is valid only for slow processes. 

It is quite typical of the development to be presented in this chapter that we apply a 
time-dependent perturbation to a system and assume that this perturbation induces a 
time-dependent response which is linearly related to the perturbation. For this reason, 
the time-correlation function approach is often called linear response theory. 

We can also write time-correlation functions of two phase space functions, 
A{p(t) 9 q(t)} and B{p(t) 9 q(t)} 9 i.e., <^4(0)i?(0>. All of the results of this chapter will be 
of the form 

iKco)= Cdte- i “ t (A(<S)B(t)' > 

J o 

where is some generalized susceptibility. The familiar thermal transport coeffi- 
cients will be the zero-frequency limit of such expressions. When A = B 9 the correla- 
tion function is often called an autocorrelation function. 

In Section 21-1 we shall present the time-correlation function formalism of the 
absorption of electromagnetic radiation. We choose this as the introductory applica- 
tion since the derivation of the key formulas is quite pedagogical, beginning with the 
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standard formulas of quantum-mechanical time-dependent perturbation theory 
(which is assumed to be known to the reader at this point). In this section we show 
that the lineshape of an infrared absorption band is given as the Fourier-Laplace 
transform of the dipole moment autocorrelation function of an absorbing molecule. 
In spite of the pedagogical nature of this section, it does require a certain degree of 
quantum-mechanical knowledge. If the reader is not familiar with the golden rule of 
time-dependent perturbation theory, the Dirac bra and ket notation or the Heisenberg 
picture of time-dependent quantum mechanics, he is referred to Appendixes F to H. 
Section 21-2 is somewhat of a digression on the classical theory of light scattering. 
This serves as a background for the next section in which we present the time-correla- 
tion function formulation of Raman scattering. The mathematical and quantum- 
mechanical apparatus of this third section is very similar to that of the first section. 
In Section 21-4 we take a more general approach and present an elementary derivation 
of the basic formulas of linear response theory. In this derivation, the linear response 
nature of the theory will be clearly evident. Then in Sections 21-5 and 21-6 we give 
two concrete examples of the general theory of Section 21-4. The first is the theory of 
dielectric relaxation, which leads naturally into the next section, which is a further 
discussion of molecular spectroscopy. Section 21-7 contains a derivation of the basic 
equations of linear response theory, this time starting from both the classical and 
quantum-mechanical Liouville equations. Although no new results are derived in this 
section, the method is fundamental and serves to introduce several important concepts. 
Then, in Section 21-8 we discuss the time-correlation function formulation of 
the thermal transport coefficients, such as self-diffusion, viscosity, and thermal 
conductivity. We shall find in this section that the derivation of the basic formulas for 
the thermal transport coefficients does not proceed as smoothly as for the previous 
applications. The reason for this is that it is not possible (at least not easy) to account 
for the thermal transport phenomena by means of a well-defined mechanical perturba- 
tion Hamiltonian. Because of this, the basic time-correlation function formulas do not 
have the general validity that the previous formulas have. This is why Eq. (21-6) 
does not have the factor e~ l<ot that appears in Eq. (21-7). Lastly in Section 21-9 we 
present a number of approximate calculations of the thermal transport coefficients 
starting from their time-correlation function expressions. We shall show that the time- 
correlation function formulas reduce to the kinetic theory of gases formulas in the 
low-density limit, but, in addition, we show how they can be used to approximately 
calculate the thermal transport coefficients of dense fluids. We shall also discuss the 
law of corresponding states of transport processes in this section. 

21-1 ABSORPTION OF RADIATION* 

Consider a system of N interacting molecules in the quantum state i (for initial). Let 
the Hamiltonian of this A-body system be 3tf 0 , with ij/j = Ej ij/j . If this system now 
interacts with an electric field of frequency co, transitions into other quantum states / 
(for final) will occur if the frequency of the radiation is close to (E f — E^/h according 
to the Bohr relation. If we let the field be monochromatic, then we can write 

E(0 = E 0 z cos a>t = ^ (e icot + e~ iat ) (21 -8) 


* This section is based upon R. G. Gordon, Adv. Mag. Resonance , 3, p. 1, 1968. 
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where E 0 is the amplitude of the field and e is a unit vector along the electric field. 
We are assuming here here that the field is spatially uniform so that we can suppress 
the wave vector dependence that occurs in Eq. (10-79), for example. Since the field is 
uniform, or since the wavelength is large compared to molecular dimensions, the 
interaction between the field and the molecules can be written as 

^ (1) (0=-M E(t) (21-9) 

where M is the total electric dipole moment operator of the A-body system. 

According to the so-called Golden Rule of time-dependent quantum-mechanical 
perturbation theory (cf. Appendix F), the probability per unit time that a transition 
from the state i to the state / takes place is given by 

n r 2 

*W") = K/|e • M|i>| 2 [<5(ct) /i - co) + <5 (to fi + co)] (21-10) 

where a) fi = o f — co £ . If we multiply this by ha) fi9 this gives the rate of energy lost 
from the radiation in going from state i to /; if we sum this over all /, we get the rate of 
energy lost in going from the initial state i to any other state; and lastly, if we multiply 
this by Pi , the probability that the system was in the initial state, and then sum over all 
i, we finally get the rate of energy loss from the radiation to the system 

- ^rad = Z Z Pi hUfiPi-f 

i f 
nE 2 

= Z Z "/.-Pi I </| * • M I /> 1 2 [<5("/i - ") + <5("/i + ")] (21-1 1 ) 

In j i 

Since the summations i and / go over all the quantum states of the system, we may 
interchange these indices in the summation over the second delta function, giving 
(Problem 21-1) 

- £rad = Z Z "/.<Pi - Pf ) I </| * • M I i> 1 2 6(a> fi -(O) (21-12) 

If we assume that the system is initially in equilibrium, then 

Pf = (21-13) 

and so 

Pi -p s = pi (21-14) 

We can now substitute this into Eq. (21-12); we find that 

- £rad = ^ (1 - Z Z Pi I </l £ • M I O I 2 <5("/i -") (21-15) 

2 ft i f 

We have dropped the subscripts on the co’s since the delta function requires that 
a) fi = o). We shall define an absorption cross section a(co) such that this cross section, 
multiplied by the incident flux of radiation, is equal to the rate of energy lost from the 
field to the system. It is shown in Appendix I that the energy flux is given by the so- 
called Poynting vector S, whose magnitude in this case is 

s_ £i t£ ” 2 “5;" v 


( 21 - 16 ) 
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where v is the speed of light in the medium, c is the dielectric constant, c is the speed of 
light in vacuo, and n is the index of the refraction of the medium. If we divide Eq. (21- 
15) by S y then we obtain 


«(0J) = — co(l - £ Y Pi I </l E • M I i > 1 : 2 - ®) 

ficn f i 


(21-17) 


It is convenient to use this equation to define an absorption lineshape I(co) by 


'(-> * iJX ’-%0 - 3 I I A I </l « • M I i> I 8 *<■„ ■ - <») 


(21-18) 


As Gordon points out, this formula represents a Schrodinger representation of spec- 
troscopy as transitions between Bohr stationary states since Eq. (21-10) is derived from 
first-order perturbation theory in which the operators are independent of time and 
the wave function of the system varies with time. In the Heisenberg representation of 
time-dependent quantum mechanics, the time evolution of the system is placed in the 
operators, and the states of the system are considered to be independent of time. 
These two different representations arise from the fact that observable quantities 
occur as inner products of the form (x, Ai}/). We can ascribe the time variation of such 
products to a time dependence of the wave functions or to the operators. The two 
representations agree for all times if 


( X (o), Awm = wo, mm) 


(21-19) 


The correspondence between these two representations is shown in Appendix H, 
which briefly goes as follows. The time-dependent wave functions x(0 and \j/(t) obey 
the Schrodinger equation 


d\I/ 

ih = 
dt 


( 21 - 20 ) 


whose formal solution for a time-independent Hamiltonian operator is 
lK0 = exp|— 

= U(t)\j/(0) (21-21) 

with a similar equation for %(0- If these are substituted into Eq. (21-19), we find 

( X (o), Aim o)) = wo. mm) 

= (U(t)x(0)y A(0)U(t)il/(0)) 

= (x(0), U*(t)A(0)UM(0)) (21—22) 

U*(t) is the adjoint operator of U(t), which is simply exp (iJ^t/h) since 34? is Hermitian 
(Appendix H). Equation (21-22) suggests that we define the time-dependent operator 

^(0 by 

A(t) = e ijrtl *A( 0)e~ ijrt/h (21-23) 

Although we have explicitly displayed the zero-time argument in A in the right-hand 
side of the above equation, it is common practice to suppress it. Equation (21-23) is 
the so-called Heisenberg representation of an operator. We shall now see that the 
Heisenberg representation of Eq. (21-18) naturally leads to a time-correlation function 
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of spectroscopy. In addition, we shall see that this has the advantage of having a clear 
correspondence with classical mechanics, which is not apparent in the Schrodinger 
picture. 

We can convert Eq. (21-18) to the Heisenberg picture by introducing the Fourier 
transform of the Dirac delta function, 

8 (a))=-Lr e io *dt (21-24) 

27C * — co 

giving 

m = I • M l/X/I« • M| i> dt ex P [ (£/ ~ Ed - a>] t (21-25) 


Now the states | V) and | /> are eigenstates of the system excluding the radiation, so 
that (Problem 21-2) 


e -i£rf/*| i ^ = e -OPo»/*|^ 

(21-26) 

and 


</| e iEf,lf> = </| 

(21-27) 

Thus Eq. (21-25) can be written as 


m = dt e £ Pi<i|c • M|/></|e • M(t)| i> 

In J - oo i,f 

(21-28) 

where 


M(t) = 

(21-29) 

The summation over the set of final states can be removed by using the closure relation 
(Appendix G) 

Il/X/I =1 
/ 

(21-30) 

giving 


/(«) = ^r dt £ p,<i 1 a • M(0)e • M(t) 1 0 

Z71 J — oo £ 

(21-31) 

This summation over i is nothing but an equilibrium ensemble average, which we shall 
denote by < >, giving 

1(0}) = ^-C dt ■ M(0)e • M(t)> 

(21-32) 

Finally, for an isotropic fluid, one can show by averaging e 
(P roblem 21-3) that 

over all directions 

I(w) = ^- f dte~ iwf <M(0) • M(t)> 

27CJ-00 

(21-33) 


This is the desired result, namely, the lineshape function I(oS) has been written as the 
Fourier transform of the time-correlation function of the dipole moment operator of 
the absorbing molecules in the absence of the field! Thus we see that a knowledge of 
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the lineshape I(<o) can be used to study the motion of the dipole moments of the mole- 
cules by Fourier inversion of /(co). As we pointed out in the introduction, a correlation 
function describes the average decay of a property of a system from some initial value. 
In the present case, the property is the dipole moment. In general, one must consider 
the dipole moment to be that of the entire system, i.e. that of the N interacting dipole 
moments. Thus in the quantity 

m(o> • M(t) = (Jim) (j>(o) 

there are, in addition to terms of the form ^-(0) • cross terms between the dipole 
moments on different molecules, Because of these cross terms, one cannot 

simply interpret <M(0) • M (f)> in terms of the reorientation of a single dipole when the 
concentration of dipolar molecules is large. If, on the other hand, the dipolar molecules 
are dilutely dissolved in some nonpolar solvent, the cross terms are negligible, and the 
correlation function 

<M(0) • M(0> = ^J>/0) • = N(.m - K0> 

can be interpreted in terms of the orientation of the dipole moment of a single mole- 
cule. 

We shall return to an application of these formulas, but it turns out for experimental 
reasons that the interpretation of the dipole correlation function is often simpler in 
the case of rotation-vibration bands in the near infrared, for it is usually a good 
approximation to treat the internal vibrational motion of the molecules as essentially 
separable from the rotational and translational motion. If we also assume that the 
vibrational Hamiltonian of the various molecules is additive, then the A-body vibra- 
tional wave function is a product of the individual molecular vibrational wave func- 
tions, and by orthogonality of the various excited vibrational states of the molecules, 
cross terms between dipoles on different molecules vanish. This means that a vibra- 
tional excitation is localized on a single molecule, and so the correlation function may 
be considered to be that of vibrational transition dipole moment of a typical molecule. 
By a transitional dipole moment, we mean in classical terms the dipole moment 
whose oscillation leads to the vibrational absorption. In a diatomic molecule, this is 
simply the dipole moment of the molecule itself. For more complicated molecules, 
however, we mean each infrared-active normal coordinate whose oscillation produces 
an oscillating dipole. For a linear molecule such as C0 2 , the two infrared active 
modes are the antisymmetric stretch and the bending mode. Since in the first case 
the dipole moment oscillates along the axis of the molecule, the absorption band is 
called a parallel band, and since the dipole oscillates perpendicular to the axis in the 
bending mode, the absorption band due to this is called a perpendicular band. In this 
case the frequency a> in Eq. (21-33) is replaced by o 0 4- co, where o 0 is the vibrational 
band center and w is the frequency displacement from co 0 . The distribution of absorp- 
tion frequencies about the vibrational frequency a ) 0 , then, is the Fourier transform of 
the average motion of the particular transition dipole moment. 

If experimental data are available to extend over the entire range of rotational- 
translational frequencies about a vibrational band, then the dipole moment correlation 
function can be determined from the inverse Fourier transform, 

<m ’ n(0> = f /(<w)e‘“ r d(0 

J band 


(21-34) 
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It is usually convenient to normalize the correlation function to its initial value by 
<|i(0) • p(0)> = f I(co) do (21-35) 

‘'band 

By dividing Eq. (21-34) by Eq. (21-35), we get 


<u(0) • u(t)> = f l(co)e i0it dco 

* band 

where l(co) is a normalized Iineshape function 


(21-36) 


Hfo)= T 

Jband ^0*0 & 0 


(21-37) 


and u is a unit vector along the direction of the transition dipole moment. Figure 21-1 
shows the normalized dipole correlation function of carbon monoxide in various 
environments. The negative region in some of these correlation functions indicates 
that it is probable that the molecule swing its dipole moment to a direction opposite 
that which it had at t = 0. Such correlation functions occur in gaseous systems, where 



Figure 21-1. The carbon monoxide normalized dipole moment correlation function in various environ 
ments. x : CO in liqnid CHCI 3 ; O : CO in liquid CCU; + : CO in liquid «-C 2 H 6 ; A : CO 
in argon gas at 510 amagats; □ : CO in argon gas at 270 amagats; — : CO in argon gas 
at 6 amagats. (From R. Gordon, /. Chem. Phys. 9 43, p. 1307, 1965.) 
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the molecule has some ability to rotate. In more dense systems, however, such easy 
reorientation is not permitted, and the correlation function simply decays mono- 
tonically from its initial value of unity to the long time value of zero. 

Note that the procedure used in the derivation of the Iineshape function in terms of 
the Fourier transform of a time-correlation function is to start with the quantum- 
mechanical perturbation theory expression [Eq. (21-10)] and then convert that into a 
Fourier transform by straightforward manipulation. The above analysis was con- 
cerned with infrared band shapes, but it should be clear that most any perturbation 
theory expression can be manipulated in the same way. In the next section we present 
somewhat of a digression on the classical theory of light scattering, and then in Sec- 
tion 21-3 we shall continue this analysis and discuss Raman light scattering. 

21-2 CLASSICAL THEORY OF LIGHT SCATTERING 

Since light scattering, in general, is beginning to play an increasingly important role 
in chemistry, physics, and biophysics, we shall digress a little here to study the classical 
theory of the Raman effect. This is not only instructive in itself, but will serve as back- 
ground to Section 22-3, in which we shall discuss the general statistical mechanical 
theory of light scattering. 

In the ordinary scattering of electromagnetic radiation by an interacting system of 
molecules, the frequency of the incident radiation and the scattered radiation is the 
same; i.e., there is no frequency change associated with the scattering. This type of 
scattering is known as Rayleigh scattering. In addition to Rayleigh scattered radiation, 
it is found that there exists, to a much smaller extent, scattered radiation in which 
there are definite frequency changes. This type of scattering, predicted theoretically in 
1923 by Smekal and observed experimentally in 1928 by Raman, is called Raman 
scattering. The Raman scattered radiation has frequencies both higher and lower than 
that of the incident radiation; these differences, however, are independent of the 
frequency of the incident radiation (assumed to be monochromatic here for sim- 
plicity) and are characteristic of the scattering medium. It is common to define a 
Raman shift by 

Av = v f — v s (21-38) 

where v £ and v s are the frequencies of the incident and scattered radiation. As pointed 
out above, Av is characteristic of the scattering medium. The frequencies such that 
Av > 0 are called Stokes lines and those for which Av < 0 are called anti-Stokes lines. 
It turns out that the Stokes lines are often much more intense than the anti-Stokes 
lines. We shall now present a classical argument which shows that these Raman 
shifts are due to rotational and vibrational transitions occurring within the scattering 
medium. 

If a system of molecules is placed in an electric field, the electrons and nuclei will be 
displaced in such a manner as to induce dipole moments. If we make the valid assump- 
tion that the dipole moments induced, /x, are proportional to the strength of the 
electric field £, then 

H = aE (21-39) 

where a is the polarizability of a molecule. If E = E 0 cos 2nvt, then [i will vary with 
time according to 

fi(t) = a E 0 cos 2nvt 


(21-40) 
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Thus we see that the field will induce a dipole moment that oscillates with frequency v. 
According to classical electromagnetic theory, this oscillating dipole will emit radiation 
of the same frequency, and hence the result will be Rayleigh scattering in which the 
incident and scattered frequencies are the same. 

Now suppose that the molecules in the system are undergoing rotational and 
vibrational motions. For simplicity, let these be diatomic molecules. As the two 
nuclei vibrate along the internuclear axis, the polarizability of the molecule will vary. 
Since the amplitude of vibration is normally small, we can write the variation of the 
polarizability due to this vibration as 

a (t) = a 0 4- cxj cos 271V,, t (21 -41 ) 

where v v represents the fundamental vibrational frequency of the molecule and is a 
measure of how much the polarizability varies with vibration. If we substitute into 
Eq. (21-40), we see that 

MO = &o E 0 cos 2nvt 4- cos 2nvt cos 271V,, t 

a F 

= a 0 E 0 cos 2n vt 4- [cos 27i(v — v v )t 4- cos 27i(v 4- v v )t] (21-42) 

This equation shows us that the induced dipole oscillates not only with the incident 
frequency v, but also with frequencies v — v v and v 4- v„ . The Raman shifts here are 
Av = + v v and so are independent of v itself but characteristic of the molecules. 

Without going through the derivation, it should be clear that the rotational motion 
of the molecule can also lead to Raman scattering. As the molecule rotates, the 
orientation of the molecule with respect to the incident field changes, and if the mol- 
ecule has different polarizabilities in different directions, its polarization, i.e., the 
induced dipole moment, will vary as a function of time. Note that for there to be a 
rotational Raman effect, the molecule must have different polarizabilities in different 
directions. Unless the molecule has a very high degree of symmetry, this will be the 
case. For example, if the molecule is diatomic, there will be a different polarizability 
along the internuclear axis than perpendicular to it. Generally, the polarizability of a 
molecule is a 2nd-rank tensor whose nine components are given by 

Fx ~ a xx Ex + a XY E Y + a xz E z 

Fy = &yxEx + &yyE y 4 - &yzEz ( 21 — 43 ) 

Fz = a zx Ex + a z y Ey + a zz E z 
or in more compact notation 

p = aE (21-44) 

Just as in the case of the moment of inertia tensor discussed in Chapter 8, one can find a 
set of axes in the molecule such that 

Fi = «|£i 

fl 2 — <X 2 E 2 (21-45) 

F 3 = a 3 E 3 

Such axes are called principal axes of polarizability, and the associated polarizabilities 
are ^called the principal values of the polarizability. These principal axes are usually 
self-evident from the symmetry of the molecule. 
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Classical electromagnetic theory also tells us that the mean rate of radiation emitted 
over all directions by a dipole oscillating according to p = fi 0 cos 2nvt is (Appendix J) 


1 = 


167t 4 v 4 
3c 3 


Vo 


(21—46) 


In cgs units, / has units of ergs per second. Referring to Fig. 21-2, the rate of radiation 
emitted per unit solid angle in the ^-direction is given by* (Appendix J) 



2tt 3 v 4 


(ttOY 2 + VOZ 2 ) 


(21-47) 


Note that this gives Eq. (21-46) back if we average \i OY 2 and \i oz over angles to give 
H 0 2 /3 for each of them and then multiply by the total solid angle 4n. 

Light-scattering experiments are often performed with polarized incident radiation. 
Figure 21-2(a) shows incident radiation directed in the Y-direction and plane- 
polarized with the electric vector parallel to the direction of observation, i.e., along the 
Y-direction. The radiation scattered in the Y-direction will be depolarized, and the 
total scattered intensity in this case is designated by 7 r (obs. ||). Figure 21-2(b) shows 
the incident radiation to be plane-polarized along the Z-axis. The intensities in this 
case are / r ( obs. _L), /|j(obs. _L), and / x (obs. _L). In the first case, the incident electric 
field lies only along the Y-axis and so \i OY = a YX E 0 and \i oz = a zx E 0 , so that 
Eq. (21-47) gives 

27T 3 V 4 

/ r (obs. ||) = ^ 3 — (ct YX 2 + a ZX 2 )^Q 2 ’ (21-48) 


z 



X 


(a) 


z 



/i (obs. i) 


/ Direction of 
observation 


(b) 

Figure 21-2. Schematic representation of the experimental conditions defining the various scattering 
intensities, (a) The incident radiation is polarized along the Y-axis, (b) The incident radia- 
tion is polarized along the Z-axis. (From E. G. Wilson, Jr., J. C. Decius, and P. C. Cross, 
Molecular Vibrations. New York: McGraw-Hill, 1955.) 


* E. B. Wilson, Jr., J. C. Decius, and P. C. Cross, Molecular Vibrations (New York: McGraw-Hill, 1955), 
p. 43. Much of the material of this section comes from this reference. 
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Similarly, in the second case, in which the incident electric field vector is plane- 
polarized along the Z-axis 


Ijiobs. ±) 

2tcV 7 - 7 

— c 3 ( a YZ + K ZZ )E 0 

(21-49) 

/„(obs. ±) 

_ 2n 3 v* 2 £ 2 

(21-50) 

c 3 a ZZ ^0 

J x (obs. ±) 

27tV « 2 e 2 

— 3 a rz 

(21-51) 


Equations (21-48) through (21-51) are appropriate for a single molecule of fixed 
orientation. In order to apply these equations to a gas of N molecules, for example, we 
must average the <x AB 9 s over all angles. Note that the ol ab are the polarizability com- 
ponents in a spaced-fixed coordinate system. If we let <x AB be the components in a 
molecule-fixed system (see Fig. 21-3), then one set is given in terms of the other by 
(Problem 21^1) 

a AA ' = X ^ aa ' C - Aa ' ^ A ' a ' (21-52) 

a, a ' 

where the C* s are the direction cosines between the spaced-fixed and molecule-fixed 
axes. Recall that the direction cosines satisfy C xj 2 + <V + c z / = 1 and Cxi Cxj 
+ Cy,* Cyj + C Z £ C Z J = 0 (Problem 21-5). In case the three unprimed axes in Fig. 21-3 
are principal axes, then (Problem 21-7) 

«AA = i«jCAiC A .j (21-53) 

j= 1 

We wish to average « AA ' over all orientations, and so we write 


a AA' ~ ^X *j C Aj C A f jj 

= X a j 2 CAj 2 C‘A'j + 2 £ off a,- C Ai C A *i C AJ C A >j 

j *<J 


(21-54) 



Figure 21-3. The relation between molecule-fixed axes (primes) and space-fixed axes (no primes). The 
direction cosines of the Z'-axis with respect to the space-fixed coordinate system are 
C x x cos(i / , I) = r ■ i, Cx'y = cos(i\ j) - i' - j, and C x z = cos(i' * k) = i' • k with 
similar relations for the six other direction cosines. 
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To evaluate these averages of orientations, first note that 


C Aj * = cos 4 6 = — J d<p j dO sin 0 cos 4 6 = \ 

( 21 - 55 ) 

Now square the relation 


Cxj 2 + Cyj 2 + C ZJ 2 = 1 

( 21 - 56 ) 

average, and then use symmetry to get (Problem 21-6) 


3 <V + 6 C AJ 2 C 2 A .j =1 A + A' 

( 21 - 57 ) 

Using Eq. (21-55), we see that 


£ 

KJ 

$ 

II 

* 

( 21 - 58 ) 


To evaluate the averages involved in the second summation of Eq. (21-54), start 
with 


Cxi Cxj + C Y i C Y j + C Zi C Zj — 0 i^j 

(21-59) 

Squaring, averaging, and using symmetry give 


ICa^Caj 2 + 6Cai Ca-i Caj Ca'j “ 0 

(21-60) 

Using Eq. (21-58), we find that 


Ca( Ca'i Caj Ca'j = “To 1 ^ J A ^ A' 

(21-61) 

We can now use all of these results to evaluate the average of Eqs. (21-48) through 
(21-51) over all orientations. If we multiply by the density p of molecules in the gas 
and introduce the incident flux of radiation 7 0 = cE 0 2 /Sn , we find (Problem 21-8) 

J T (obs. ||) = 4 p/o(2£oc/ 2£>.-ocj) 

15 C \ j i<j / 

(21-62) 

J T (obs. X) = 4 pt Q U £ cc/ + £ cc.-ocA 

IOC 1 j i<j / 

(21-63) 

/|,(obs. X) — 15 c4 p/ 0 ( 3 I a / + 2 J£a, a,-) 

(21-64) 

The summations in these equations can be written in more 
the spherical part of the polarizability a, 

lucid form by introducing 

a = K a i + a 2 + a 3 ) = ^ Tr a 

(21-65) 

and the anisotropic part p. 


P = a — al 

(21-66) 

where 1 is the unit tensor. Note that (Problem 21-9) 


P 2 = Tr p 2 = ^[(2a, - a 2 - a 3 ) 2 + (2a 2 - a t - a 3 ) 2 + (2a 3 - a, - a 2 ) 2 ] 

= H( a i - a 2 ) 2 + (“2 - “s) 2 + (a 3 - a,) 2 ] 

(21-67) 


Note that for a linear molecule, the three principal values of the polarizability are 
ay, a x , and a x , where ay is the polarizability along the internuclear axis and a x is that 
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perpendicular to the internuclear axis. If we let the internuclear axis be the Z-axis, the 
a H i s a zz an d the a x are a xx and <x Y y > which are equal due to the cylindrical symmetry. 
Table 21-1 gives some values of cq, a 2 , and a 3 for a number of molecules. 

For linear molecules or, in fact, for symmetric top molecules as well (Problem 
21 - 10 ) 


a = 10*11 + 2 «x) (21-68) 

P 2 = K a || — a x) 2 (21-69) 

Values of a and p 2 can be calculated from Table 21-1. In terms of a 2 and p 2 then, 
Eqs. (21-62) through (21-64) become (Problem 21-11) 



16tiV 

p 2 


/ T (obs. ||) = 

c 4 

pi oy 

(21-70) 


167i 4 v 4 

r 45a 2 + Qp 2 


Ij{ obs. _L) = 

c 4 

Ph 45 

(21-71) 


16tt 4 v 4 

r 45a 2 + 6 P 2 


7 a (obs. X) = 

c 4 

O' ” 45 

(21-72) 


Table 21-1. Polarizability of molecules 



d • 10 25 
(cm 3 ) 

a, 10“ 
(cm 3 ) 

a 2 • IO 25 
(cm 3 ) 

a 3 • 10 25 
(cm 3 ) 

location of principal 
axes and direction of 
dipole moment; 
structure of molecule 

h 2 

7.9 

9.3 

7.1 

7.1 

ai symm. axis 

n 2 

17.6 

23.8 

14.5 

14.5 

cLy symm. axis 

Oz 

16.0 

23.5 

12.1 

12.1 

ai symm. axis 

Cl 2 

46.1 

66.0 

36.2 

36.2 

oli symm. axis 

HC1 

26.3 

31.3 

23.9 

23.9 

a! symm. axis 

HBr 

36.1 

42.2 

33.1 

33.1 

ai symm. axis 

HI 

54.4 

65.8 

48.9 

48.9 

oli symm. axis 

CO 

19.5 

26.0 

16.25 

16.25 

oti symm. axis 

co 2 

26.5 

40.1-41.0 

19.7-19.3 

19.7-19.3 

ai symm., linear 

so 2 

37.2 

54.9 

27.2 

24.9 

fjL — fj* 3 ; a 2 J_ to OSO- 
plane 

n 2 o 

30.0 

48.6 

20.7 

20.7 

cLy symm. axis 

HCN 

25.9 

39.2 

19.2 

19.2 

ol x symm. axis, linear 

H 2 S 

37.8 

40.4 

34.4 

40.1 

fx = [ 1 * 3 ; a 2 J_ to HSH- 
plane 

CS 2 

87.4 

151.4 

55.4 

55.4 

ai = symm., axis; linear 

nh 3 

22.6 

24.2 

21.8 

21.8 

ai symm. axis; //, = /u^; 
pyramidal 

ch 4 

26.0 

26.0 

26.0 

26.0 

Reg. tetrahedron 

c 2 h 6 

44.7 

54.8 

39.7 

39.7 

o ty symm. axis 

ch 2 =ch 2 

42.6 

(56.1) 

(35.9) 

(35.9) 

ay symm. axis 

CH=CH 

33.3 

51.2 

24.3 

24.3 

ol ± symm. axis, linear 

c 3 h 8 

62.9 

50.1 

69.3 

69.3 

ayj_ CCC-plane 

c 6 h 6 

103.2 

123.1 

63.5 

123.1 

_L plane of ring 

ch 3 ci 

45.6 

54.2 

41.4 

41.4 

fj, — fjLy ; ocy symm. axis 

ch 2 cj 2 

64.8 

50.2 

84.7 

59.6 

/Lt = /x 3 ; cx.y J_ CICCl-plane 

chci 3 

82.3 

66.8 

90.1 

90.1 

CLy symm. axis; fi = fiy 

CC1 4 

105.0 

105.0 

105.0 

105.0 

Reg. tetrahedron 


Source: J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Cases and Liquids (New York : 
Wiley, 1954). 
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A quantity that is often experimentally observed and reported is the depolarization 
ratio, defined as 

jzL°- b iJP = 9 P 2 (2W3) 

/ r (obs. J.) 45a 2 + 2 rP 2 


For future reference, we note that 

/ x (obs. 1 ) = / T (obs. 1 ) - /n(obs. 1 ) = pl 0 ^ 

depends only upon the anisotropic part of the polarizability ft 2 and that 
/ ||(obs. ±) - f/ x (obs. ±) = —^r~ plo* 2 


(21-74) 


(21-75) 


depends only upon the spherical average a. We shall see in the next section that 
/j | — f/ x and 7 X can be expressed in terms of the time-correlation function of a and p, 
respectively. 

Although this section is mainly a degression for preparation for the next section on 
Raman light scattering and later sections on the general theory of light scattering, we 
are in a position to discuss an interesting pioneering paper by McTague and Birnbaum* 
on the so-called collision-induced light scattering by dense rare gases. As Eq. (21-73) 
shows, for molecules that are optically isotropic, such as rare gas atoms, there should 
be no depolarization since the polarizability is isotropic, and in fact there should be no 
frequency shift since the scattering should be purely Rayleigh scattering. This is 
indeed so for single atomic scatterers, but consider the situation at high enough 
densities, e.g., 100 atm, where there are many binary collisions. As two rare gas atoms 
collide, they form what can be considered to be a transient homonuclear diatomic 
molecule. The collision process is essentially a transient vibrational motion, accom- 
panied by a polarizability change due to the distortion of the electronic structure 
during collision. This transient change in polarizability should lead to a transient 
anisotropic polarizability which, in turn, should lead to a depolarization of the incident 
radiation according to Eq. (21-74). This effect was first seen by McTague and Birnbaum 
who analyzed their experimental results essentially by the equations that we have 
presented above. 

They start with Eqs. (21-71) and (21-70), which say that for the case where the 
detector measures both the y- and z- polarized components of the scattered radiation, 
one has 


co 

I A obs. -L) = / 0 P — (a + to P) 

Is 


(21-76) 


and 


7j(obs. ||) — 7 0 p — 


co 4 /? 2 
c w 5 


(21-77) 


where co = 2nv. 

Figure 21-4 shows the experimental data for this scattering. Note that the intensity of 
the scattering is proportional to p 2 , suggesting that this effect is due to bimolecular 


From J. P. McTague and G. Birnbaum, Phys. Rev. Lett., 21, p. 661, 1968. 



Ifp (arbitrary units) 


CLASSICAL THEORY OF LIGHT SCATTERING 483 



0 2 4 6 8 10 


p (moles/liter) 

Figure 21-4. Density dependence of scattering intensity at two frequencies, z-poiarized intensity is 
indicated by open symbols, jr-polarized intensity by filled symbols. (From J. P. McTague 
and G. Bimbaum, Phys. Rev., Lett., 21, p. 661, 1968.) 

collisions. These expressions can be readily modified for this collision-induced light 
scattering by noting that in this case both a and f} 2 are functions of the interatomic 
distance r. Realizing this, we can modify Eqs. (21-76) and (21-77) by considering a 2 
and p 2 to be given by 

p r °° 

a 2 = ^ g(r)ot 2 (r)4nr 2 dr (21-78) 

2 Jq 

and 

P 2 = J f g(r)P\r)4nr 2 dr (21-79) 

^ J o 

where g(r) is the radial distribution function, which at these densities is simply 
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exp [—u(r)/kT], where u(r) is the intermolecular potential and the factor of \ in front 
of the integral is due to the fact that we are dealing with a collision between like atoms. 
Equations (21-76) and (21-77) become 

/ x (obs. ±) = 2nl 0 p 2 ^ j [a 2 (r) + P 2 (r)]g(r)r 2 dr (21-80) 

C 

/ T (obs. ||) = 2nI ° p0} f p 2 (r)g(r)r 2 dr (21 -81 ) 

jC J Q 

where a(r) and /?(r) refer to the polarizability increments relative to r = oo. 

McTague and Birnbaum find that the depolarization ratio 7 r (obs. ||)// r (obs. _L) is 
0.86 ± 0.04 for argon and krypton at frequencies of 10 and 15 cm -1 . Assuming that the 
Iineshapes for the incident x- and z-polarizations are the same (Fig. 21-2), these 
results show that <a 2 (r)> </? 2 (r)>, where < > indicates that average over r indicated 

in Eqs. (21-80) and (21-81). This shows that as two rare gas atoms collide, the aniso- 
tropic part of their transient polarizability change is much larger than the change of 
their isotropic or spherical part. These measurements serve as a very nice probe into 
the electronic structures of two colliding molecules. Levine and McQuarrie* and 
DuPre and McTaguef have presented simple theoretical calculations of the transverse 
and longitudinal polarizabilities of a pair of colliding rare gas atoms. 


21-3 RAMAN LIGHT SCATTERING} 


We begin with the quantum-mechanical expression for the differential scattering 
cross section for scattering into a frequency range do and a solid angle dil:% 


d 2 a 
do dd 


= K 4 E Pi\ <*l € ° * « * € s |/>| 2 <5(co - <o fi ) 
U 


(21-82) 


In this equation, 2 nX s is the wavelength of the scattered radiation; e° and e s are unit 
vectors in the direction of the incident and scattered radiation, respectively; a is the 
polarizability tensor of the group of interacting molecules; and as before, <o fi = 
(E f — Ei)/h. The polarizability tensor is given by 


a = 2e 2 Y <°l r l t, >< t, l r l°> 

v E v E 0 


(21-83) 


where r = Ir,-, and the summation here is over the coordinates of all the electrons. 
Although Eq. (21-82) is very similar in appearance to Eq. (21-10), it is, in fact, not as 
generally valid. The derivation of Eq. (21-82) involves a number of assumptions: The 
frequency of the incident radiation must be far from the molecular absorption fre- 
quencies (i.e., nonresonant scattering); the frequency shifts must be small compared 
with both the incident and scattered frequencies; the ground electronic state must be 
nondegenerate; and (£ min /fi) co 0 | <o 0 — coj , where E mln is the electronic excitation 
energy of the lowest optical transition. 

We can now manipulate Eq. (21-82) in much the manner that we used to derive 
Eq. (21-33). We introduce the Fourier representation for <5(co — co /£ ), use Eqs. (21-26) 


* H. B. Levine and D. A. McQuarrie, 7. Chem. Phys. t 49, p. 4181, 1968. 
t D. B. DuPre and J. P. McTague, J. Chem. Phys ., 50, p. 2024, 1969. 

% R. G. Gordon, Adv. Mag. Resonance , 3, p. 1, 1968; J . Chem . Phys., 42, p. 3658, 1965. 
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and (21-27) for e iEr ' 1 " \ /> and </ eliminate the summation over /by using 

closure, and finally get the analog of Eq. (21-31), namely, 

d 2 a 1 r°° 

= 2 ~j dt e- ,w <(e° ■ a(0) ■ e s )(e° • a(f) • e s )> (21 -84) 


As in Section 21-1, we have transformed the scattering cross section from the Schro- 
dinger picture into the Heisenberg picture. 

For a spatially isotropic system such as a liquid or a gas, we may average Eq. (21-84) 
over all (equivalent) directions £° and £*. The integrand of Eq. (21-84) contains the 
factor 

(*° • <°) • £S X«° • *(0 • «■) = (Z Z^iV/Ote/j^Z ( Z%°a*i(0«/ 5 j 

Z Z Z Z e i° £ j%% Sa ij(0M0 (21-85) 

1= 1 j~ 1 k= 1 /= 1 

and so we must evaluate the spherical average of efefs^ef. If we denote this average 
by < > S p h e r e, then Eq. (21-84) can be written as 


d 2 a 


dco dQ 2n 


ir 

2nJ- 


00 t t t <*i ( 21 — 86 ) 

00 1=1 j = 1 k= 1 1=1 


where it should be clear that the angular brackets without a subscript denotes an 
ensemble average. We shall consider separately the two cases in which £* is parallel to 
£° and £ s is perpendicular to £°. In order to do this average in the parallel case, we 
need to consider quantities (sj e/e k fi^Xphere* where is the fth component 
(a = x , y , z) of c° = c 5 and the notation O sp here denotes an average over the surface 
of a sphere. For example, the following special cases are elementary. 


<£ z ll £ z ,l £ z l, £ z ,, > = <cos 4 0> = f d(j) f dQ sin 0 cos 4 0 = y 

H7Z 

= <«AW>-<AW> 

<£ JC ,, € x ,, e z ,, e z ,l > = <sin 2 0 cos 2 <j> cos 2 0) = -ps 

= <£ y ,l €y ll € z ll € z 11 ^ = <€ Jc ll € JC ,l € y ll £ y ll > 

<£ Jc ll e y l, £ z ll £ z II > = 0, etc. 

These results can be written in the compact form 

(e, 11 e/ e* 11 e, 11 > sph ere = tVO^jAi + &ik &ji + &u $jk) 


(21-87) 

( 21 - 88 ) 

(21-89) 

(21-90) 


The corresponding calculation for the perpendicular case is a little more complicated 
since one must evaluate (e^e^e^e/X where e k x represents to kth component of £ s , 
which in this case is perpendicular to £°. One must first express the e k x in terms of 
and then perform the integration over the sphere (see Problem 21-12). The result is 


— To(45 afi S yd — 5 ay S fid — S a6 S fiy ) (21-91 ) 

If we now substitute these results into Eq. (21-86), we get (Problem 21-14) 


, ■ i L *■ 5 <■**•<«> + (2 ’- 921 
4^), - s C * *'“ s 5 <3M0K ' <0 “ M0)a ' j(,)> 


(21-93) 
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These can be rewritten as 


( d^er \ 1 r 00 

d-rfn) |( = 30 ^ J dt e "'“'< 2 Tr “(°) ’ “(0 + [ Tr «( 0 )][Tr <*( 0 ]> (21-94) 

± = (Lf dt Tr “ (0) ' “ (0 " [Tr “ (0)][Tr “ (0]> (21 ~ 95) 

At this point it is convenient to decompose the polarizability tensor into a spherical 
part oil and a traceless anisotropic part p 

a = al+p (21-96) 

where I is the unit tensor, 

a = + <*2 + “a) (21-97) 

and 

Tr P = 0 (21-98) 

Substituting this into Eqs. (21-94) and (21-95) gives (Problem 21-15) 


X A 

X A 



tV <15a(0)a(0 + 2 Tr P(0) • P(t)> 
- i “' 1 L<Trp(0)-p(0> 


(21-99) 

(21-100) 


Note how these equations compare with Eqs. (21-72) and (21-74). Following through 
to Eq. (21-75), this leads us to define 


d 2 a 


pol 


dcodQ 


and 


so that 


d 2 a 


dcpol 


dco d£l 


_ / d 2 a \ 4 / d 2 a \ 

= \do}dn) }{ ~3 \dcodn) ± 

\dcodn). 




1 r” J 
= Tm J dt 


— icot / — 


dco dQ 2n 


<a(0)a(t)> 


and 


d 2 a 


depol 


dcodQ 2nJ_ 


-if dt e~ iwt <Tr P(0) ■ P(t)) 


(21-1 01) 


( 21 - 102 ) 


( 21 - 1 03 ) 


( 21 - 104 ) 


If the scattering process produces no change in the vibrational state of the molecules, 
then the polarized component above corresponds to Rayleigh scattering, and the 
depolarized component corresponds to the conventional rotational Raman scattering. 
In the vibrational Raman effect, on the other hand, when the scattering process 
produces a vibrational transition, the polarized component corresponds to a polarized 
g-branch of the vibrational band, while the depolarized component corresponds to 
the anisotropic part of the vibrational band. This becomes an important matter 
only when one is actually analyzing spectroscopic data and involves a greater degree 
of spectroscopic knowledge than is necessary here. 
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The depolarized component turns out to be particularly interesting. As in Section 
21-1, it is convenient both theoretically and experimentally to define a normalized 
intensity distribution 


;<<») 


i— 1 

XthtdQJ L 


O ( d2 ° \ 

J \d(odSi) ± 


(21—1 05) 


dco 


and a normalized correlation function, which decays from unity at t = 0: 
<Tr P(0) • p(0> 


C(t) 


<Tr P(0) ■ P(0)> 


(21-1 06) 


The normalized band shape and normalized correlation function are related through 
the Fourier transform: 


1(0})= hf c ® e ~ ia ' dt 


(21-107) 


C(t) = f l((o)e la ' d (0 (21-108) 

“ — 00 

The form of £{t) becomes quite simple for a totally symmetric vibration. A totally 
symmetric vibration in a linear or symmetric top molecule maintains the symmetry 
of the molecule. For a totally symmetric vibration, the anisotropic part of the polar- 
izability has the form 


Pij = constant^ u s - iS u ) (21 -1 09) 

where tq is the ith component of a unit vector fixed along the symmetry axis of the 
molecule. Thus 

Tr P(0) ♦ P(t) = t i W)uj(0) - i SJluMujit) - J SJ 

i=i ^=1 

= 1 I [w i (0)w J -(0)w £ (0w J -(0] ~ i (21 -1 1 0) 

i=l ;=1 

and the normalized correlation function C(t) is 

C(t) = (21-1 1 1 ) 

In the classical limit, w(0) and u(t) commute, and so C(t) can then be written in the form 

C(t) = <F 2 [u( 0) - u(t)]> (21-112) 

where P 2 (x) is a Legendre polynomial, P 2 (x ) = i(3x 2 — 1). Remember that u here is a 
unit vector pointing along the symmetry axis of the molecule. Thus we see that the 
depolarized component of a Raman spectrum can be Fourier-inverted to give the 
time-correlation function of the rotation of the molecule in terms of F 2 [u(0) • u (*)]. 
Figure 21-5 shows such a correlation for liquid nitrogen. Values of the correlation 
function at times longer than those indicated in the figure are unreliable because of the 
fairly low resolution in the experimental Raman data used. 
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Figure 21-5. The rotational correlation function, C(t) </’ 2 (u(0) - u(/)», for liquid nitrogen. In (a), 
C(t) is plotted on a linear scale, and in (b) it is plotted on a logarithmic scale. (From R. G. 
Gordon, /. Chem. Phys. 9 42, p. 3658, 1965.) 

Note that in the case of infrared absorption treated in Section 21-1, the normalized 
correlation function can be written in terms of a Legendre polynomial of first order, 
Pi(x) = X, or 

C(t) = <^[ 11 ( 0 ) • u(t)]> 

We see that infrared and Raman spectra determine different measures of molecular 
rotations. Figure 21-6 shows the two different time-correlation functions for liquid 
methane at 95°K. Gordon* presents an interesting interpretation and discussion of 
these two correlation functions. 




Figure 21-6. (a) The dipole correlation function, <.Pi(u(0) - u(/))> = <u(0) • u(/)> for liquid methane at 
98°K. (b) The rotational correlation function, </ > 2 (u(0) • u(/))> for liquid methane at 95°K. 
(From R. G. Gordon, J. Chem . Phys., 43, p. 1307, 1965.) 


* R. G. Gordon, /. Chem . Phys. t 43, p. 1307, 1965. 
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The two effects that we have considered thus far, infrared and Raman spectra, are 
both developed in the same manner. One starts with the quantum-mechanical expres- 
sion for the appropriate cross section in the so-called Schrodinger picture and then 
transforms it into the Heisenberg picture, which ends up in the form of a Fourier 
transform of a time-correlation function. The corresponding experimental spectrum 
can then be numerically inverted to give the time-correlation function itself as a 
function of time, and this can be used to give much insight into the molecular motion. 
From the two cases studied so far, we can learn about the molecular orientation as a 
function of time, both in the gaseous and liquid states. 

We could proceed at this point to investigate other applications of this approach. 
For example, neutron scattering in liquids can be used to probe the time dependence 
of the translational motion of molecules in liquids. One starts with the appropriate 
cross section, which in this case is the standard Born approximation for the differential 
scattering cross section of a collision of a neutron with an atom and then manipulates 
this expression into the Heisenberg picture and hence a time-correlation function. We 
shall actually do this in Section 22-1, but now we shall present in the next few sections 
a more general and abstract formulation of the time-correlation function approach to 
molecular motion from which the results obtained so far are special cases. 


21-4 AN ELEMENTARY DERIVATION 
OF THE BASIC FORMULAS 

In this section we shall present an elementary derivation of the basic formulas of the 
time-correlation function formalism. This derivation is not only elementary (in the 
sense that it does not require any specialized mathematical techniques such as operator 
calculus), but also displays a simple one-to-one correspondence between the mathe- 
matical steps in the derivation and the steps in the performance of an experiment. This 
derivation is due to Zwanzig, whose 1964 review* still serves as an excellent introduc- 
tion to this subject. For concreteness only, we shall consider the case of electrical 
conductivity, although the formalism itself is quite general. 

In an experiment designed to measure the electrical conductivity of a substance, one 
applies an electric field across the system and measures the electric current that flows in 
response to this field. Usually this experiment is repeated a number of times, and the 
results, the measured currents, are averaged. If we denote the electric current by J, the 
conductivity (tensor) by a, and the applied field by E, then the conductivity is defined 
by the relation 

< J> = a • E + 0(E 2 ) (21-1 1 3) 

We are assuming here that the field is small enough that nonlinear effects are of no 
importance; i.e., we are assuming that Ohm’s law is valid. This is an excellent assump- 
tion under normal conditions. 

Equation (21-113) does not indicate that the quantities involved, particularly the 
conductivity, may be functions of the frequency of the applied field. As it stands, Eq. 
(21-113) implies a steady or dc conductance. In order to determine the frequency 
dependence of the conductivity, one uses a time-dependent electric field. This is turned 


* R. Zwanzig, Ann, Rev . Phys. Chem . , 16, p. 67, 1965. 
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on at t = 0, and then one measures the average current which will now, of 

course, be a function of time. In this case, Eq. (21-113) reads 

< J(0> = ( - O • E (O + 0(E 2 ) (21 -1 1 4) 

J o 

The (tensor) function 0>(t — t') here is called the after-effect function and shows that, 
in general, there may be a delay between the field and the current induced by the 
field. Values of the field at times earlier than t affect the average current at t . The after- 
effect function represents a time lag of the system. If the system can respond instan- 
taneously to E(f), then O is a delta function. Ordinarily, however, the system lags 
behind E(f), and O(f) is a monotonically decaying function of time. 

The integral in Eq. (21-114) is of the form of a Laplace transform convolution. If 
we take the Fourier-Laplace transform of Eq. (21-114), then we get 


3 

? 

II 

/s 

N/ 

(21-115) 

where 


<j->= fW tof <j(o> 

J o 

(21-116) 

E a = Cdte-^m 

J o 

(21-117) 

and, most importantly for our purposes. 


a(o))= dt e~ itat <S>{t) 

J o 

(21-118) 

This gives us the frequency-dependent conductivity of the system under study. 

Although we are focusing on the case of electrical conductivity, it should be clear at 
this point that the development presented thus far is general. If we let B be the quantity 
that responds to the external field F, then Eq. (21-114) becomes 


(21-119) 

where </> is the after-effect function for this particular process, 
becomes 

Equation (21-115) 


(21-1 20) 


where x(g>) is called the (frequency-dependent) susceptibility of the system. 

We shall now show how to express the frequency-dependent conductivity or sus- 
ceptibility, in general, in terms of a Fourier-Laplace transform of a time-correlation 
function. Following Zwanzig, we note that there are three stages to an experiment 
designed to measure electrical conductivity. First, the system must be prepared; then 
the field is turned on; and then the current is measured. Each of these stages has an 
analog in the following derivation. 

In the first stage the experimenter may select a length of wire from a spool, and then 
he repeats this process a number of times. In a statistical mechanical sense he is select- 
ing samples from a thermal equilibrium (canonical) ensemble. Let be the Hamil- 
tonian operator for the entire V-body sample, with corresponding eigenfunctions and 
eigenvalues given by 


^0 — En tyn 


( 21 - 1 21 ) 
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The probability of observing the system in the state n is given by 

g—fiEn g fiEn 

Pn = £ e -eE n Q~ ( 21 — 1 22 ) 

n 

The average current, or response in general, is found by calculating the response in 
some particular eigenstate n and then averaging over the canonical ensemble distribu- 
tion of eigenstates. 

In the second stage of the experiment, the field is turned on. This field will interact 
with the total dipole moment M of the system and add a term of the form 

JP x (t) = -M • E(t) (21-1 23) 

to the unperturbed Hamiltonian 0 - As the notation E(t) suggests, we are assuming 
for simplicity that the field is uniform over the system, i.e., does not depend upon r. 
Recall that the total dipole moment operator is 

M = 'Ze J r J (21-124) 

j 

where the summation runs over all charges in the system. If we let the field be directed 
in the ^-direction, then 


= -M a E a (t) (21-125) 

The presence of the applied time-dependent field will alter the state of the system, 
which now must be governed by the time-dependent Schrodinger equation 

d'F 

ih — = + tf x {t)\¥(t) (21-1 26) 

ct 

with the initial condition that ¥(0) = with probability p n . If J^ x (t) is small, which 
is so here, it is a standard problem in time-dependent perturbation theory to show that 
to first order in the applied field strength (cf. Appendix F) 

¥(0 = (21-127) 

m&n 


where 

c m (0 = — f ( dt’ (21-128) 

J o 

co n = EJh, and h is set equal to unity. 

In the third and final stage of the experiment, one measures the current. In the 
statistical mechanical theory, this corresponds to calculating the expectation value of 
the current operator, first with respect to T(t) and then with respect to the canonical 
distribution of initially prepared states «. The component of the current in the 6th 
direction is 

Jb = Y e i v jb = Z e j Ofc = M b 

j j 


(21-1 29) 
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where the notation v jb represents the 6th component of the velocity of the jth charged 
particle. The expectation value of J b over x ¥( t ) is 

J ¥ W* *(0 <*»" = <« I J*l«> 

m&n ^0 
m*n ^ 0 

+ 0(E 2 ) (21-130) 


= (Jf) m 

+ I i f ' dty b ) n JiMXne- i{a --' aM, -'' ) E a (t') 

m±n J 0 

I i [' dtXM a ) nm (J b ) mn e- i ^’'- (0mH, -°E a (t') 

m*n J 0 

+ 0(E 2 ) (21 —1 31 ) 

where we have introduced the more compact notation ( A) nm = (n\A \ m). 

This is the expectation value of J a for a given initial state n. We now canonically 
average over n by multiplying by p n and summing over n to get 

< = < 4>. q + (df bjt - t')E a (t') + 0(E 2 ) (21 -1 32) 

J o 

where </ fc > eq is the average current in the 6th direction at equilibrium 

<4>eq =Z P n (Jfc)nn (21-133) 

rt 

Since there is no current before the field is turned on, < J b ) eq = 0. The after-effect func- 
tion indicated in Eq. (21-132) is given by 

= £ I ipJLiJtXJMXne-^-^ ~ (21-1 34) 

n m±n 

We can use the Heisenberg equation of motion to write the matrix elements of J b in 
terms of those of M a . The Heisenberg equation of motion for the operator M is 


ih 


dM 

~di 


= ih3 = [M, 3>e\ 


(21-1 35) 


Since we are considering only a linear relation between the current and the field, 
<p ba must be independent of the field, and so (Problem 21-16) 

= «M, JToBu, + 0(E) 

= (M JT 0 ) n „ - (X 0 M) mn + 0(E) 

= £„(M) mn - £ m (M) mn + 0(E) 

= (E„ ~ EJ(M) mn + 0(E) (21 -1 36) 

In addition, it is easy to show that (Problem 21-17) 

[3(t)] mn = e-^"-^<(J) mn + 0(E) 


(21-137) 
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We can use these last few relations to derive several equivalent formulas for <f) ba (t ). 
If we substitute Eq. (21-137) directly into Eq. (21-134), we see that 

<M0 = 4 Z Z PnW a ) nm im\rn n - [JMUKW 

Ml rt m&n 

= ^ Z p n m«m\ m - [mM a u 

= ^ Tr{p[M a ,J b (t)]} (21-138) 

We have used the fact that [J b (t)] nn = ( J b ) nn = 0, according to Eq. (21-136). 

Another standard alternative expression for <j> ba (t ) can be derived from Eq. (21-138) 
by noting that 

Tr (ABC) = Tr (BCA ) = Tr (CAB) (21 -1 39) 

which says that the trace is invariant to a cyclic permutation of the operators or 
matrices involved. This allows us to write 


= -Tv{p[M a ,J b (M 

(21-140) 

= 1 {TrlpMMm ~ Tr[pJ b (t)M a }} 

(21-1 41) 

= 1 {Tr[pM a J b (t)] - Tx[M aP J b m 

(21-142) 

= I {Tr[p, M a ]J b (t)} 

(21-143) 


Equations (21-140) and (21-143) are two standard expressions for <p ba (t). 

We can use Eqs. (21-138) and (21-136) to derive one other expression for <j> ba (t ) 
(Problem 21-26) 


4>ba(t) = Z Z pJtJJJLmi 




E„-E m 


(21-144) 


This can be written in a compact form by defining a so-called Kubo transform by the 
relation 


j a = - 0 dXe x *°J a e- x *° 

P J o 

The matrix elements of J a are (Problem 21-28) 

(h) mn = \( dXe x ^-^\j a ) mn 

p J o 

= n E m ~E n J (Ja)mn 


(21-145) 


(21-146) 
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and so Eq. (21-144) can be written as 

I I pJLhXAmu 

n m&n 

= pTr[pJ a J b (t)] 

= Khmy ( 21 - 147 ) 

This last form is most convenient for taking the classical limit since in the classical 
limit the Kubo transform of an operator A becomes simply the classical expression for 
A (Problem 21-20). The classical limit of the other two expressions for <f> ba (t) is more 
difficult to obtain since both the numerators and denominators go to zero as h -* 0 
(Problem 21-19). Since Tr pA corresponds to an integral over phase space in the 
classical limit, we see that 

Tr [pAB(t)\ J | dp dqf 0 (j>, q)A(0)B(t) 

where 

e -fi* ) 

P = Tr = J . . . | dp dq e ~pn(p,q) 

Similarly, since the quantum-mechanical commutator corresponds to the classical 
mechanical Poisson bracket, then we also have 

^ Tr[{[p, A]B{i)} *$■■ ■ f dp dq(f 0 , A)B(t) (21 -1 50) 

^ Tr{p[A, £(*)]} *-+ j--- jdp dqf 0 (A, B(t )) (21 -1 51 ) 


(21-148) 

(21-149) 


where (A, B(t )) here denotes the Poisson bracket notation 


(A, B ) = 



dA 8B 
dq t dpi 


dAdB\ 

dp, dqj 


(21-152) 


Although we have obtained these classical limits essentially by inspection, they can 
be derived directly from the Liouville equation. We shall do this in Section 21-7 after 
we have discussed a few applications of the general formalism derived here. 

In summary, then, we have derived an expression for the frequency-dependent 
conductivity in terms of a Laplace-Fourier transform of a time-correlation function: 


a bo {w) = pfdte-^\j a J b {t)y 

J o 


(21-153) 


Although we have used the example of electrical conductivity for concreteness, the 
above derivation is quite general. It is straightforward to show (Problem 21-29) that if 

^(f) = ~AF(t) (21-154) 


and if the response is denoted by B , then 

<*(*)> = (df <Kt - t')F(t') + 0(F 2 ) (21-1 55) 

J o 

and 


= tfp>)F a 


(21-156) 
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where 


*(<£>) =f dte '“'<£(0 (21-157) 

J o 

The quantity x(co) is the frequency-dependent susceptibility of the system. The equiva- 
lent forms of 4>(t) corresponding to Eqs. (21-143), (21-138), and (21-147) are 



( 21 - 158 ) 

= ±Tr{p[A,B(t)]} 

( 21 - 159 ) 

= pTr[pAB(t)] 

( 21 - 1 60 ) 

where the tilde over A denotes the Kubo transform 


A=- 0 f dkJ*Ae~** 

p J o 

( 21 - 161 ) 

Problem 21-22 investigates some of the properties of For example, it is shown 

there that </>(*) is real and odd if A = B. 

It may not be obvious at this point that the spectroscopic applications of Sections 
21-1 and 21-3 are readily derived from this formalism. In the next two sections we 
shall show that this is so. 


21-5 DIELECTRIC RELAXATION 

In this section we shall discuss the behavior of a set of noninteracting dipoles in a 
time-dependent external field. At low enough frequencies, the orientation of the dipoles 
can keep up with the instantaneous value of the field, but as the frequency increases, 
the dipoles start to lag behind the field. This time lag is commonly referred to as 
dielectric relaxation, and we shall see below that it is responsible for an absorption of 
energy from the external field. 

Consider a nonconducting isotropic polarizable gas and an externally applied 
electric field E(t), which we take to be in the z-direction. Since the system is non- 
conducting, this electric field induces a net dipole moment in the system. The function 
B(t) of the previous section, then, is the total dipole moment M z (t). The Hamiltonian 
of the system is 

=3tf 0 - M z E z (t) (21 -1 62) 

where 3f 0 is the Hamiltonian of the entire system in the absence of the electric field, 
and so the function A of the previous section is M z . The susceptibility x(cS) is given by 

X(a>)=fe- i<M (Kt)dt (21-163) 

J o 

with 

m = I Tr {p[M z , Mjm (21-1 64) 


= pTr[pM z M z (t)\ 


(21-165) 
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The susceptibility can be expressed in terms of the frequency-dependent dielectric 
constant. If P is the polarization of the gas, i.e., the total dipole moment per unit 
volume, then 

P=X(co)E (21-166) 

Since D = E + 4rcP and D = eE, we have the relation between the susceptibility and 
the dielectric constant: 


e(co) = 1 + 4nx(cS) 


(21-1 67) 


Notice that since 4>(t) is real (Problem 21-22), x(&0 defined by Eq. (21-163) is a 
complex quantity. This means that e(co) is formally complex also. Of course, physical 
quantities are real and not complex, but it is often convenient to define certain functions 
to be complex and then identify the real and imaginary parts with measurable quanti- 
ties. For example, if one introduces a harmonic electric field e™* into the equations of 
electromagnetism, one is led quite naturally to define a complex dielectric constant 
whose imaginary part is directly related to the absorption of electromagnetic radiation 
by the system. This is discussed in some detail in Appendix K. It is shown there that 
if we write e(c o) as 

e(a>) = e'(co) — ie”(co) (21— 1 68) 

then the imaginary part e"(co) is given by 




nccc(co) 

co 


(21-169) 


where n is the index of refraction, c is the speed of light, and a {co) is the absorption 
coefficient of Lambert’s law, / = / 0 exp(— oot). Since £"(&>) represents the absorption 
by the system it is often called the dielectric loss. Substances with large values of 
e"(a>) are said to be “lossy” in that frequency region. 

Now let us consider the classical limit of Eq. (21-164) for a dilute gas. We can 
ignore the tilde denoting the Kubo transform in this limit and write 

m = /KM Z ( 0)M Z (0> 

= 0Z</izi(OK-(O> (21-170) 

i.j 

In a dilute gas, we can neglect those terms here in which i i.e., we can assume that 
the dipole moments on different molecules are uncorrelated. Thus we can write <j>(t) as 

cKt) = PN<fi z (0)ti z (t)y (21-171) 

We can write </} z (0)jx z (f)> as the time derivative of </i z (0)/x z (f)> by means of the 
following set of manipulations. First, we must recognize that since the angular brackets 
in the expressions for time-correlation functions denote an equilibrium ensemble 
average, we can write that 


(A{0)B(t)y = <A(s)B(t + s)> (21-172) 

An average that satisfies this condition is said to be stationary. Physically, it means 
that there is no preferred origin of time in a system that is in equilibrium. In particular, 
if we let s = — 1 9 

<A(0)B(t)>=<A{-t)B(0)y 
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Using this, we can write 

<Az(0K(t)> = </iz(-0^(0» 

= - j t <^(-0a*z(0)> 


= - j f <X(o K(0> 


(21-173) 


The reader should get used to the kind of manipulations that lead to Eq. (21-173). 

In order to calculate the time-correlation function, we would have to solve the 
equations of motion of the gas to find fi z (t), which we would then canonically average. 
This is certainly possible in the case of a dilute gas, but becomes a formidable task for 
dense systems. Fortunately, time-correlation functions have a physical interpretation 
which allows one to devise sensible approximations. The dipole moment time-correla- 
tion function is easy to picture physically. It is a measure of the correlation of fi z (t) 
with }i z ( 0). Initially it is </* z0 2 >, but as the molecule suffers collisions with its neighbors, 
this correlation decays to zero. The molecule no longer remembers the initial direction 
of its dipole moment. The Debye theory of dielectric behavior is obtained by simply 
assuming that <A* z (0)/z z (f)> decays exponentially from its initial value, i.e., 

=^o 2 e _|,|/r (21-174) 


where t is a measure of the duration of the correlation. The time t is called a relaxation 
time. If we substitute Eq. (21-174) into Eq. (21-173), this result into Eq. (21-170), and 
that into Eq. (21-163), we get 


X(o>) 


/W 



cos c ot 



■r/t 


sin c ot 




(21-175) 


-a 2 ( 1 ion ' 

~ PPo \1+co 2 t 2 1+coV, 

This gives 

£ '('“)- 1 -4”^ 2 (rrW) 

'■(“)■ 4 ”^" 2 (r^v) 

These are the famous Debye equations. They show that e'(co) decreases from e'(0) 
to unity with increasing frequency. Most of this decrease occurs within a fairly small 
frequency range centered about co = 1/t. Similarly, e"(o>) changes from a small value 
through a maximum to a small value again. This maximum occurs at co = 1 /t. Physi- 
cally, as the frequency of the external field approaches 1/t, the dipoles start to lag 
behind the field, and eventually the frequency becomes high enough that they essentially 
do not respond to the field at all, and hence e'(c o) 1. Equation (21-178) shows that as 
the molecules begin to lag behind the field, they absorb energy from the field, and 
continue to do so until the frequency becomes so high that the dipoles no longer 
respond at all. 

Figure 21-7 shows experimental data for iso-butyl bromide. The agreement between 
theory and experiment is seen to be very good in this case. It should be pointed out. 


(21-176) 

(21-177) 

(21-178) 
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100 30 10 3 I 0.3 


Figure 21-7. Dependence of e'{uS) (A) and e"(a>) (2?) of /-butyl bromide npon the logarithm of the wave- 
length (cm). The solid lines are the Debye curves, and the dashed lines are drawn through the 
experimental data. (From E. J. Hennelly, W. M. Heston, Jr., and C. P. Smyth, J. Amer. 
Chem. Soc. 9 70, p. 4102, 1948.) 


however, that this is somewhat exceptional. In many cases the dispersion occurs over a 
wider frequency range, with a maximum value of e"(co) lower than that predicted by 
Eq. (21-178). This effect has been attributed to the fact that there is actually a number 
of, or a distribution of, relaxation times, rather than just the one as implied in 
Eq. (21-174). The reader may notice in Fig. 21-7 that e f (co) levels off between 2 and 3 
rather than unity. This is simply due to the fact that there are other mechanisms that 
contribute to the dielectric constant at the frequencies given in the figure. 

Before leaving this section we wish to discuss an important relation between 
e'(co) and e"(c o) or, more generally, between /(&>) and x\co). This connection, called 
the Kramers-Kronig relation, is 


1 r ' (o' d(o‘ 

X'(") = - J j — - 

71 CO — CO 

2 r 00 CO do' 

x»=— f xV)-r—7 

71 J 0 0 —0) 


( 21 - 1 79 ) 


( 21-1 80 ) 


The Kramers-Kronig relations show that x'(co) and x"(co) are not independent quanti- 
ties; a knowledge of one implies a knowledge of the other. 

The proof of the Kramers-Kronig relations goes as follows. Since 


*(G>) = x'M - ix"(<u) = [ fi 

J A 


( 21 - 1 81 ) 


we have 


r 00 

X'(co) = cos c ot dt 


( 21 - 1 82 ) 
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and 


r 00 

x(co) = sin cot dt 

J o 

Equations (21-182) and (21-183) can both be inverted to give 

(21-183) 

2 r 00 

<j){t) = - x'{co) cos cot dco 

71 J o 

(21-184) 

2 r 00 

= - x'X 0 *) dco 

* J 0 

(21-185) 


To prove Eq. (21-179), we substitute Eq. (21-185) into Eq. (21-182): 

2 <•“ r" 

X'(<o) = — dt cos at x"(o/) sin a't da>' 

71 0 ^0 

2 r °° r R 

= — lim dco' x!X<°') cos cot sin co't dt 

7tR-co J 0 J 0 


= - lim f dco' f{0i')\ f 

71 K- oo J 0 ^ L 


1 — cos(co' + co)R ^ l 


cos(ct/ — c o)R~ 


co f + C0 


co — CO 


Now the integrals containing the cosine terms vanish as R -> oo because the integrands 
oscillate infinitely rapidly. Therefore, we finally have 



co' dco' 
co' 2 — CO 2 


(21-186) 


which proves Eq. (21-179). The proof of Eq. (21-180) is very similar (Problem 21-23). 

Kittel* gives an interesting application of the Kramers-Kronig relations. Problem 
21-24 asks the reader to show that the Debye equations, Eqs. (21-177) and (21-178), 
satisfy the Kramers-Kronig relations. 


21-6 TIME-CORRELATION FUNCTION FORMALISM 
OF MOLECULAR SPECTROSCOPY 

We saw in the previous section that the imaginary part of the dielectric constant 
was directly related to the absorption of electromagnetic radiation. This is what 
provides a way for us to investigate the shape of spectral lines in terms of the dipole 
moment correlation function. As indicated in the first few sections, this approach was 
developed by Gordon and has been applied to infrared absorption, Raman scatter- 
ing, fluorescence, nuclear magnetic resonance, and many other phenomena.*!* We shall 
study some of these applications in this section within the more general formalism 
developed in Section 21-4. 

According to Eqs. (21-163) and (21-167), the imaginary part of the dielectric 
constant is given by 

r °0 

e"(co) = 4tc 4>(t) sin cot dt (21 -1 87) 

J o 

* Statistical Physics, New York: Wiley, 1958. 

t See, e.g., R. G. Gordon,/. Chem. Phys., 42, p. 3658, 1965; 43, p. 1307, 1965; 44, p. 3083, 1966; 45, p. 1643, 
1966; 45, p. 1649, 1966; Adv. Mag. Resonance , 3, p. 1, 1968. 
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Problem 21-61 shows that cp(t) is an odd function of t, and so we can write Eq. (21-187) 
in the form 

An 


e» = - - f <Kt)e~ ia, dt 

—CO 

= -y J Tr [p z (t)p z Po ~ Pz Pz(t)Po\e~ dt 


(21-188) 


where we have used Eq. (21-164) for $(t). 

This integral can be written in a more convenient form by using the relation 


P Tr[pAB(t)]e- i<ot dt = f°° Tr[pB(t)A]e~ io>t dt 

" — co * — CO 


(21-189) 


The proof of this equation goes as follows. Substituting in the expressions for p and 
B(t), we have 

f” TT[pAB(i)]e~ ho ‘ dt = y- f °° Trie-"* Ae i * ,, *Be- i * ,, *]e- ia ' dt (21-190) 

We can change the integration variables from t to t — iph by bringing the factor 
exp(— from the first to the last position in the expression in brackets (the trace is 
unaffected by this cyclic permutation), inserting exp(/tef) exp(— fitf) directly before B, 
and multiplying the front by expQSfrco). Doing all this gives 


f” Tr[e~ P *A e iJt,lh Be ~ i "‘ 

Q -CO 


dt 




= f Tt[A (21—1 91 ) 

Q ^ — co 

If we now bring A from the first to the last position, bring exp (— ft Jf) through to the 
first position (this term commutes with exp[iJg*(f — iph)jh] since commutes with 
itself), let t — iph = z, we get that 

,00 + ififi 


f Ti[pAB(t)]e~ im dt = e p *<° f Ti[pB(z)A]e~ iciz dz 

^ — co ^ — oo — ifih 

= J" Tr[pB(t)A]e~ kM dt 


(21-192) 


where we have assumed that Tr[pl?(0^]-^ 0 rapidly enough at t ± -► oo and that it is an 
analytic function of z for 0 > Im z > — ph . This equation says that a factor of 
exp(pha)) results when the order of the operators in the Fourier transform of a 
quantum-mechanical correlation function is reversed. Of course, this factor is unity in 
the classical limit, where the operators become ordinary dynamical functions, which 
commute. This is just one of many relationships among various forms of correlation 
functions. 

If we use this result on Eq. (21-188), we have 


e» = 


27t(l — e 


f <p z p z (t)>e-“* dt 

— 00 


(21-193) 
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We define an absorption lineshape /(to) by 
he"(o}) 1 <•“ 

I{(0) ~ 4^(1 - e -**) = 2n * ( 21 - 1 94 ) 

Using the reaction between e"(co) and a (o) given by Eq. (21-169), we see that this is 
Eq. (21-18), which we derived in Section 21-1 by starting from quantum-mechanical 
time-dependent perturbation theory. 

Note carefully that the time-dependent operator fi z (t) occurs on the right-hand side 
in the above correlation function. Such a correlation function is said to be one sided . 
Several important properties of the one-sided correlation function </z z /z z (f)> follow 
from the fact that the frequency spectrum I{co) is a real quantity, as is dear from its 
definition in Eq. (21-194). Thus letting C(t) = <^ r ^ z (0) we have 

/(to) = = -5- f dt C(t)*e‘°“ 

2n J -„ 

1 r 00 

= — dtC(-t)*e i(0t (21-195) 

2n 


where we have changed the integration variable from t to — t. Comparing Eq. (21-194) 
and Eq. (21-195), we have 

C(0 = C(-0* (21-196) 

If we separate this into its real and imaginary parts, we get 


Re C(0 = Re C( - 1) (21 -1 97) 

and 

Im C{t) = -Im C(-0 (21 -1 98) 

Thus the real part of a one-sided correlation function is an even function of time, and 
the imaginary part is an odd function of time. Since classical mechanical correlation 
functions are real functions, we find that they must be even functions of time. Quan- 
tum-mechanical correlation functions, on the other hand, are generally complex 
quantities and have both even and odd parts as a function of time. The imaginary part, 
being an odd function of time according to Eq. (21-198), represents a strictly quantum- 
mechanical contribution, which becomes negligible as the temperature increases. A 
classical correlation function, being real and even in time, leads to a lineshape function 
/( o) that is even in co, or, i.e., symmetric about cd 0 , and so the observed asymmetry in 
/( coi) can be used to determine the strictly quantum-mechanical part of C(t). In practice, 
for most molecular gases at most temperatures, classical mechanics provides a suffi- 
ciently good approximation to C(t). 

Even though a calculation of time-correlation function for all times is a difficult 
dynamical calculation, it is possible to write C(t) as a power series expansion about the 
initial time. By Taylor’s theorem, the expansion is 


<A(0)A(t)} = £ £ 
n=o n! 


d" 

df 


</i(0M(0> 


(21-199) 


t=o 
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The time derivatives may be evaluated by a repeated application of Heisenberg’s 
equation of motion for A(t): 


dA 

dt 


= l -W,A\ 


(21-200) 


This gives the general expansion 
00 (iff 

<2l(0)/f(0> = I <A(0), [JT, [X [tf, /f(0)]. . .]]]> (21-201 ) 

n= 0 n M 

in which the commutator is repeated until the Hamiltonian operator appears n times in 
the /zth term. 

An important point to notice here is that the coefficients of f involve no dynamical 
calculation; they are simply averages of equilibrium quantities. This can be readily 
seen by considering the classical limit of Eq. (21-199). In that case, we have 


<A(0)A(t)> 


= ^( 0 ) 


/1(0) + M(0) + -J‘(0) + 


= (A(O)A(O)) + t<A(0)A(0)> + - <A(0)A(0)> + 


( 21 - 202 ) 


The coefficient of t 2 can be written in a different form by using the relation 


<A(0)A(0)) = <2((0)/l(0>] ( ^ 




-[|o.(-.)Ao)>]_ o 

= -<X(0)^(0)> 


( 21 - 203 ) 


In addition, we can show that the coefficient of t vanishes: 

</)(0)/i(0)> =\{^ t ^ = 0 ( 21 - 204 ) 

since the rate of change of a quantity must be zero at equilibrium. Equation (21-202) 
then becomes 

<A(0)A(t)> = (A(0)A(0)) - £ (A(0)A(0)) + ■■■ (21 - 205 ) 


Notice that this result is consistent with the fact that classical time-correlation functions 
must be even in time. Furthermore, notice from Eq. (21-201) that the even powers of t 
are real and that it is the odd powers of t that are imaginary. Not only do the odd 
powers of t not vanish in the quantum-mechanical case, but these terms are responsible 
for the imaginary part of a one-sided quantum-mechanical time-correlation function. 

The coefficients in the time power series may be given a physical identification as 
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moments of the appropriate frequency spectrum. We can invert Eq. (21-194) and 
expand the exponential in the integrand, giving 

<^(0> = r 

^ — ro 



(21-206) 


Comparing this with Eq. (21-201), we see that the moments of I(co) may be calculated 
by an equilibrium average. Often the first few moments give a satisfactory para- 
metrization of the lineshape. 

Perhaps we should say a word at this point about the limits of co in Eq. (21-206). We 
ordinarily do not think of a frequency as taking on negative values, but it is often 
very useful to do so. For example, if /( o) represents the lineshape of some particular 
vibrational absorption band, it is convenient to take the zero of frequency at the band 
peak. In Fig. 21-10, which shows the profile of a particular vibrational absorption of 
CH 4 broadened by He, we would consider the curves to be centered about <o = 0. 
Similarly, in Fig. 21-9, which shows the far infrared absorption spectrum of an argon- 
neon mixture, we would extend the curve symmetrically through co = 0. This is strictly 
a matter of convenience, however. 

Let us now return to Eq. (21-194): 


/ ( £0 ) = ^t /_ dt 


(21-207) 


We can derive approximate lineshapes by assuming simple, physical approximations 
for </* z /* z (0>- We shall consider only the classical limit; so our approximations will be 
real, and hence 1(d) will be a symmetric function. We treat three examples. 


COLLISIONLESS MICROWAVE ABSORPTION 

Consider a collisionless gas of permanent dipolar molecules. Since there are no 
collisions, the molecules simply rotate undisturbed with some angular frequency o 0 , 
and the dipole moment time-correlation function can be written as 


0* z Mz(0> = Hoz 2 COS <D 0 t (21 -208) 

If we substitute this into Eq. (21-207), we get 
u 2 

1(d) = Jy- [8(0) 0 - d) + S( (O 0 + d)] (21 -209) 


This result for I(o) says that the system will absorb radiation only at o = co 0 and, in 
fact, all the incident radiation at that frequency. The lineshape for this system, then, is 
an infinitely narrow line at co = o 0 . This is the correct classical electromagnetic 
result. Of course, the molecules in a gas experience collisions, and this has the effect of 
destroying the long-time correlation in Eq. (21-208). 


PRESSURE BROADENING OF MICROWAVE LINES 

In order to include the effect of molecular collisions in the correlation function, we 
say that the probability that a molecule will not experience a collision by the time t is 
given by e -t/T , where t represents the mean time between collisions. We can write a 
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simple approximation for </z 2 (0)/i z (/)> by using this factor to modulate the cos a> 0 1 in 
Eq. (21-208) to write 

<A*zA*z(0> = Voz 2 e~ Wl COS co 0 1 (21-210) 

The lineshape corresponding to this correlation function is 

1 

-7 + (®o - G>) 2 
T 

u o- 2 r i/t ~i _ 

« for to >0 (21-211) 

47t 1 

7 + ( c °- «°o) 

T 


T 


+ (C0 0 + (o) : 


Note that the term in brackets here becomes n5(o — a> 0 ) as t -> oo (Problem 21-25), 
and hence this expression for /( co) reduces to the collisionless case as the mean time 
between collisions approaches infinity. Equation (21-211) for I(co) is known as a 
Lorentzian lineshape and is shown in Fig. 21-8, for several values of 1 /t. It can be seen 
there that the effect of collisions is to broaden the delta function of Eq. (21-209). A 
Lorentzian curve does not have finite moments beyond the first. This is because the 
exp(— [ 1 1 /t) in Eq. (21-210) is not analytic at t = 0, and hence </* z /i z (t)> does not 
possess a Taylor expansion in time about t = 0. It is customary to express the width of 
a Lorentzian curve as the width at half the maximum value. The width, then, is 2 /t. 
The mean time between collisions is approximately the mean free path divided by the 
average velocity, and so is 0(10“ 10 sec) at standard temperature and pressure. Con- 
sequently, pressure broadening of microwaves lines is expected to be of the order of less 
than 1 cm -1 . This is in accord with experimental results. Of course, the correlation 
function we have used here is just an approximation, but it does seem to capture 



Figure 21-8. Lorentzian line shape. 
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much of the “ physics ** of the problem. There is an entire research field of pressure 
broadening, whose primary goal, in a sense, is to calculate the broadening of spectral 
lines in terms of the intermolecular interaction between the colliding molecules. 


COLLISION-INDUCED ABSORPTION 

You learn in physical chemistry that a molecule must possess a permanent dipole 
moment in order to absorb radiation in the microwave or far infrared region. Although 
this is true for an isolated molecule, it is not entirely correct for a collection of mole- 
cules. Equation (21-194) simply requires that there be a dipole moment time-correlation 
function, and this does not require a permanent dipole moment. To see this, consider 
a gaseous mixture of two rare gases, say neon and argon. As a neon and argon atom 
collide, they form a transient heteronuclear diatomic molecule, and so possess a 
transient dipole moment. This dipole moment lasts only during the collision, and its 
correlation function might be approximated by 

<Hz Mz(0> = Vo*-' 11 ' 1 (21-212) 


where now t is the duration of a collision. Since this correlation function is due only 
to collisions, the absorption associated with this process is called collision-induced 
absorption. It is possible to observe collision-induced absorption. The spectrum 
associated with Eq. (21-212) is 


m 


Vo T _-cd 2 t 2 /4 

2tz 1/2 


(21-21 3) 


This equation predicts that the spectrum is Gaussian, with a variance given by 2/t 2 or a 
standard variation of 2 1/2 /t. The mean duration of a collision is the range of the poten- 
tial divided by the average speed, or 0(10" 13 sec). This predicts that the absorption 
predicted by Eq. (21-213) should be several hundred wave numbers wide. The experi- 
mental data for neon-argon absorption are shown in Fig. 21-9. Although Eq. (21-213) 
is approximate, it nevertheless is probably a fair representation of the true correlation 


4.5 X I0" 8 



Figure 21-9. The line shape of the neon-argon far Infrared collision-induced absorption spectrum. (From 
D. R. Bosomworth and H. P. Gush, Can. J. Phys., 43, pp. 751-759, 1965. Reproduced 
by permission of the National Research Council of Canada.) 
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function. From Eqs. (21-206) and (21-199) one can show that the nth moment of 
7(co) is given by 

oo 

CD n I(co) d(D = (— 0"<n(0) * |X ( " ) (0)> (21 —21 4) 

— oo 

where ]i (n \0) represents the nth time derivative of p(l) evaluated att = 0. The important 
point here is that the above average is an equilibrium ensemble average and does not 
involve any time dependence. It is straightforward to show that in the classical limit 
(Problem 21-62) that 

<p(0) • ji (w) (0) = 0 n odd 

= <p (n/2) (0) • p (n/2) (0)> n even (21 -21 5) 

The p ( " /2) can be determined from a repeated application of the equation of motion of 
p(f) in the Poisson bracket notation 

du 

~±=(}L,H) (21-216) 


This approach has been used by Brenner and McQuarrie* to derive a lineshape theory 
of collision-induced absorption in rare gas mixtures. 




Figure 21-10. (a) The effect of various densities of helium on the v 3 band of CH* . The individual profiles 
have been normalized to show equal areas; the relative intensity scales have been displaced 
for clarity of presentation, (b) The real and imaginary parts of the correlation function as 
obtained from a CH*-He v 3 band profile at 436 amagats. (From R. L. Armstrong, 
S. M. Blumenfeld, and C. G. Gray, Can, J. Phys. 9 46, pp. 1331-1340, 1968. Reproduced 
by permission of the National Council of Canada.) 


S. L. Brenner and D. A. McQuarrie, Can. J. Phys. 9 49, p. 837, 1971. 
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So far we have calculated the lineshapes of certain processes by approximating the 
time-correlation function by simple functions. As we saw in Section 21-1, it is also 
possible to invert experimental data and determine an experimental time-correlation 
function. This is shown in Fig. 21-1. Another example is shown in Fig. 21-10. 

Figure 21-10(a) shows the effect of various densities of helium on the v 3 band of 
methane. Since a classical correlation function gives a symmetric lineshape function, 
the asymmetry in these bands allows the imaginary part of the correlation function to 
be determined also. Both the real and imaginary parts of C(t) are shown in Fig. 21-10 
(b). Note that the imaginary part is, in fact, much smaller than the real part. This is in 
agreement with the approximating C(t) by a classical correlation function. 

21-7 DERIVATION OF THE BASIC FORMULAS 
FROM THE LIOUVILLE EQUATION 

In this section we shall derive our previous results by starting from the Liouville 
equation. Although we shall not derive any new equations, the method is often used 
and also introduces several useful new concepts. The first half of this section will deal 
with the classical mechanical derivation, and the second half will treat the quantum- 
mechanical one. 

Consider a classical system in equilibrium. At t = 0 we apply a force F(t). We assume 
that the force may be represented by an additional term Jf ext of the Hamiltonian, so 
that we may write 

2te = 3^0 + -^ext = - AF(t) (21 -21 7) 

where A = A(p , q) is a function defined in phase space. Let the response be denoted by 
B{p, q); then according to Eq. (21-1 19), the average of B(t) is related to F(t) by 

<B(0> = (df tft - t')F(t') (21-21 8) 

J o 

We wish to find the response function <j)(t) or the susceptibility xiai) in terms of the 
phase space functions A and B. To do this, we simply must determine the phase 
space distribution function f(p, q, t) to terms linear in F(t) and then use it to determine 
<B(0> according to the standard formula for the evaluation of a phase space average, 
namely, 

<£(*)> = J dp dq B(p, q)f(p, q, t) (21 -21 9) 

Assuming that the external field is small, we can write the distribution function as an 
unperturbed part plus a small perturbed part 

/=/ 0 + A/ (21-220) 

By substituting this and Eq. (21-217) into the Liouville equation and jusing 

|^=(^o./o) (21-221) 

we get 

^ = (*o , A/) - F(t)(A,f 0 ) (21-222) 

to terms in first order of the perturbation. 
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It is convenient to introduce the Liouville operator, 

= 4 f ) (21-223) 

where 4 / here is just an arbitrary phase space function on which operates. In terms of 
this Liouville operator, the Liouville equation is 

j; = -i&f (21-224) 

whose formal solution is 

f(P, q, 0 = e~ ise, Ap, q, 0 ) (21 -225) 

The equation of motion of any dynamical quantity a (p, q), not depending explicitly 
on time, can be written in terms of the Poisson brackets or the Liouville operator: 

da 

— = -(jr, a) = iS£ a (21-226) 


The formal solution to this equation is 

a(t)=e u * a(0) (21-227) 

This is to be compared to the formal solution of the Liouville equation, Eq. (21-225). 

The expectation value of a (/?, q\ weighted according to the ensemble density 
/0> < 1 > 0. is 

<« I /( 0> = J \ dp dq a{p, q)f(p, q, t) 

= (a\e~ i,se f(0)} (21-228) 

= <e“\x|/(0)> 

= <a(0|/(0)> (21-229) 

In going from the second to the third line above, we have used the fact that exp (±it&) 
is unitary. This follows from the fact that we have defined & such that it is Hermitian, 
i.e., such that 

J J q)&h(p, q) dp dq = j J (&g)*h(p, q) dp dq (21 - 230 ) 

for functions g(p , q) and h(p> q) that vanish sufficiently rapidly at infinity in momentum 
space and outside of some volume in coordinate space. We shall use the fact that 
exp(±/t-Sf) is unitary below. Equations (21-228) and (21-229) show that expectation 
value of a at time t can be calculated in two ways: by following the evolution of the 
ensemble density or the evolution of the dynamical function. 

In terms of the Liouville operator, Eq. (21-222) becomes 

^ = - i& 0 A/- F(t)(A,f 0 ) (21-231 ) 

This equation can be solved formally as a linear first-order differential equation to 
give (Problem 21-33) 

A/(t) = - fV-^,/ 0 )F(0 df (21-232) 

J o 

We have assumed that Af = 0 at t = 0, i.e., that the system is in equilibrium at t = 0. 
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The expectation value of B(p , q) is given by 

<*!/«> = <*|/o> + <B\ A/(0> (21-233) 

Without loss of generality, we may assume that the expectation value of B is zero for 
the system before the external field is turned on. Then we can write 

= <B\ A/(t)> 

= dp dq B{p ' 

= - J* dt' J jdp dq[e-«--"*°B] [A(t%Ut')]F(t') 

- - £ dt’F(t') J \dpdq [AinUOMt -t') (21-234) 

In going from the first integral to the second, we have used the unitary property of 
exp (i&t), and to write the last line we have used the fact that e iset B(G) = B(t) according 
to Eq. (21-227). This may be written as 

<£(*)> = ( dt’ - f) (21 -235) 

J o 

where 

4>(t) =f jdpdq[f 0 ,AmB(t) 

= J jdpdqf o [A(0),B(t)] 

= <M(DX H(0]> (21-236) 

The second line follows from the first by an integration by parts. 

If the distribution / 0 is assumed to be canonical, 

f 0 = e -fi*o/j J e ~ f *° dp dq 

then Eq. (21-226) can be used to show that (Problem 21-21) 


(fo,A)=PAf 0 (21-237) 

If we substitute this into Eq. (21-236), we get another form for 4>(t) y namely, 

<K0 = P<A(0)B(t)> (21-238) 

Furthermore, since the angular brackets denote an equilibrium ensemble average, we 
have the stationarity condition 

</4(0)£(t)> = <A(s)£(t + s)> (21 -239) 

which gives 

— (A(s)B(t + s)y — (A(s)B(t + s)y + (A(s)B(t + s)y 
as 

= </i(0)£(0> + (A(0)B(t)y = 0 (21 -240) 

We see then that another form for 4>(t) is 

= -p(A(0)B(t)y (21-241 ) 
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We had presented most of these expressions for 4>(t) in Section 21-4, but here they 
were derived directly from the classical Liouville equation. This is an important 
exercise in itself since the Liouville is the most fundamental equation of statistical 
mechanics, and many statistical mechanical transport theories start with this equation. 

We shall now derive the corresponding quantum-mechanical time-correlation 
function expressions. In making the correspondence between classical mechanical and 
quantum-mechanical expressions, the distribution function is replaced by the density 
matrix p, the Poisson bracket by a commutator, and the integration over phase space 
by the trace operation. The quantum-mechanical Liouville equation is (Problem 
21-34) 


ur. 


(21—242) 


where the square brackets denote the commutator of and p, i.e., 3tfp — pJF, and 
we have introduced the quantum-mechanical Liouville operator 


&p = { -W, p]=U*p-p*] 

n n 


(21-243) 


The quantum-mechanical analog of the classical equation of motion is the Heisen- 
berg equation of motion. In the quantum-mechanical case, the a (p, q) of Eq. (21-226) 
is an operator, whose time evolution is given by 


dcL 1 

— = - [Jf , a] = i&* 
dt ih 

The formal solutions of Eqs. (21-242) and (21-244) are 
p(t)=e- i '*p(0) 
a (t) = e use a(0) 


(21-244) 


(21-245) 

(21-246) 


The more conventional form of these equations is the Heisenberg representation for 
the time dependence of a quantum-mechanical operator: 


p(t) = e- , - r ' / V(0y jr ' / * 
oc(0 = 

The expectation value of a is given by 
<a|p(t)>=TrM0] 

with analogous forms for Eq. (21-229). (See Problem 21-35.) 
The equation for A p is 


(21-247) 

(21-248) 


(21-249) 


= -iSe 0 Ap - — [A, p o JF(0 


(21-250) 


whose solution is 


Ap = - f V 

J n 


- ,)se ° PoMO dt' 


(21-251) 



DERIVATION OF THE BASIC FORMULAS FROM THE UOUVILLE EQUATION 


511 


The expectation of B is 

<£(0> = <B|Ap> = Tr[BAp(0] 

= ~kS 0 ^ F(t ) TT t Be ‘ ir - ,) * 0 l A > />oD 

= fdt’ F(t') Tr {B(t - p 0 (t')]} (21 -252) 

in 

The response function is 

4>(t) = 4 Tr{[p 0 , A]B(t)} (21 -253) 

in 

Thus we have derived Eq. (21-158) by starting with the quantum-mechanical Liouville 
equation. By using the fact that the trace of a product is invariant with respect to a 
cyclic permutation, i.e., that (Problem 21-18) Tt(ABC) =Tt(CAB) = Tt(BCA ), it is 
easy to show that 

4>(t) = 4 Tr ^oU> (21-254) 

in 

which was derived before [cf. Eq. (21-159)]. 

Lastly, we can derive Eq. (21-160) by assuming that p is canonical, i.e., that 
p 0 = e~ p *°ITr e~ p *°, and making use of the identity 

[Po ,A]=( p 0 e**°[A, Jfr 0 ]e-**° dX 
J o 

r p 

= ih I p 0 A(—ihX)dX 

= ihPp 0 A (21-255) 

where as before we have defined the Kubo transform of A by 
~ 1 / . 

A = -\ A(-ifiX)dX 
P J o 

1 r p 

= - J e^°Ae~ x ^ dX (21-256) 

Equation (21-253) for <j>(t) can then be written as 

4>(t) = P Tr[p 0 BA(t)\ (21-257) 

which is Eq. (21-160). 

We are now in a position to summarize the results that we have derived so far. For 
an external force F(t) that adds a term — AF(t) to the Hamiltonian of the system, the 
response B(t) is given by 

B(t) = ( W - t')F{f) dt' 

J o 


( 21 - 258 ) 
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where we have the following correspondences between the various expressions for the 
response function <j>(t): 


classical 

J jdpdq[f 0 ,A(0)]B(t) 

J fdpdqf o [A(0),B(t)] 
pj j dp dqf 0 A(G)B(t) 


quantum 

^Tr{[Po,A](Bt)} 

i- Tr{p 0 [A, B(t)]} 
ih 

P Tr{p 0 AB(t)} 


We have explicitly subscripted the ensemble distribution functions in these expressions 
to emphasize that they are equilibrium distribution functions. 


21-8 TIME-CORRELATION FUNCTION EXPRESSIONS 
FOR THE THERMAL TRANSPORT COEFFICIENTS 

In the previous sections we have presented several derivations of the basic formulas 
expressing a susceptibility x(co) as the Fourier-Laplace transform of some appropriate 
time-correlation function. Since all of these alternative derivations are exact to within 
certain general assumptions (such as linear response), they must all be rigorously 
equivalent to one another, which, in fact, they are. They are all based upon the fact 
that the externally applied field adds a perturbation term to the Hamiltonian of the 
system. For the applications we have discussed so far, this perturbation term arises 
naturally since it is of a mechanical or electromagnetic nature, but what if we wish to 
discuss thermal transport phenomena such as diffusion, viscosity, and thermal con- 
ductivity? There is no simple or obvious way that a concentration gradient or a 
temperature gradient can be incorporated into a well-defined perturbation Hamiltonian 
since these are of a thermal or nonmechanical nature, and so the previous derivations 
do not lend themselves to these processes in any straightforward manner. Neverthe- 
less, it is possible to derive expressions for the standard transport coefficients such as 
the coefficient of self-diffusion D , the viscosity 17 , and the thermal conductivity X in 
terms of integrals over certain time-correlation functions. Because of the lack of a 
well-defined mechanical perturbation Hamiltonian, however, these derivations are more 
subtle than those in the previous sections. The reason for this is that a number of 
assumptions must be made in the derivations, and it is often difficult to assess the 
quantitative limitations of the resulting equations. The first such approach was 
presented by Green in his important pioneering work in the 1950s.* His work contains 
a number of approximations and assumptions, principally that the dynamical process 
underlying the transport process of interest is a Markov process, and the departure 
from thermal equilibrium is small. His derivation is expressed in terms of a type of 
Fokker-Planck equation in a space of certain macroscopic variables (called “gross 
variables ” by Green). He was the first to obtain expressions involving time-correlation 
functions for the coefficients of shear and bulk viscosity, thermal conductivity, diffu- 
sion, and thermal diffusion. 

Following Green’s work, a number of alternative derivations of these basic equations 
appeared in the 1950s and 1960s. Some of these approaches differed greatly from one 

* M. S. Green,/. Chem. Phys, 20, p. 1281, 1952; 22, p. 398, 1954. 
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another, although they all gave essentially the same final result. In Zwanzig’s 1965 
review article,* he discusses six different derivations of the equations for the thermal 
transport coefficients! This was a very active area of research in the 1960s, and the 
limitations and generalizations of these equations became fairly well understood by the 
end of that decade. This work involves more advanced concepts and techniques than 
the level of this introductory chapter, but in the next chapter we shall derive exact 
equations that are somewhat complicated and abstract but which reduce to the simpler 
equations to be derived below when appropriate well-defined limits are taken. These 
more rigorous derivations show that the simpler equations are exact in the limit of 
small disturbances and long times, i.e., in the hydrodynamic limit. 

We wish to present a more elementary and heuristic derivation, however, in this 
section in order to introduce and discuss the basic time-correlation function equations 
in the hydrodynamic limit. Out of all the possibilities, we have chosen the method 
presented by Helfand in 1961f since it is not only pedagogically one of the most 
appropriate but the one we shall generalize in Section 22-5. 

As an introduction to Helfand’s approach, we shall consider the case of self- 
diffusion. We start with the diffusion equation 

gG k ^ = DV 2 G( r, t) (21-259) 

with the initial condition 

G( r, 0) = <5( r) (21-260) 

This is the macroscopic equation governing the probability G(r, t) dr that a particle is 
located in the volume element dr at r at time t given that it was initially located at the 
origin. We are going to consider only the case of isotropic diffusion, so that G(r, t) = 
G(r, t ), but we shall continue to write G(r, t ) to remind ourselves that G is a three- 
dimensional function. Sinje the time-correlation function expression for D that we are 
in the process of deriving is based upon the validity of this equation, the formula 
itself must be subject to the same limitations, namely, that spatial inhomogeneities 
must be smoothly varying and the time must be long compared to microscopic times 
(cf. Section 20-1). 

If we write the Fourier transform of G(r, t) as 

/• s (k, t)=j dr e ,h ' r G( r, t) (21-261 ) 

then clearly 

F£ k, 0) = 1 (21-262) 

It is easy to show by taking the Fourier transform of both sides of Eq. (21-259) that 
(Problem 21-36) 

’ * - = —k 2 DF£k, t) (21-263) 

ot 

whose solution is 

F£ k, t) = exp(— /c 2 Z)0 (21-264) 

* R. Zwanzig, Ann, Rev. Phys. Chem. , 16, p. 67, 1965. 
t E. Helfand, Phys. Fluids, 4, p. 681, 1961. 
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According to this equation for F£ k, t\ 



(21-265) 


We can derive another expression for this derivative by differentiating Eq. (21-261) 
directly, giving 




= - Kr\t)> 


(21-266) 


giving the average square displacement as a function of time. Thus we have that 

<r 2 (0> = 6 Dt (21-267) 

an equation that we derived in Chapter 20 [cf. Eq. (20-19)]. This equation, in fact, 
was first derived by Einstein in his early work on Brownian motion. If the diffusing 
particle had originated at the point r 0 instead of the origin, then Eq. (21-267) would be 
simply 

<|r (0 — r 0 | 2 > =6Dt (21-268) 

It is instructive to derive Eq. (21-268) in another way. We have interpreted G( r, t) 

as the conditional probability that some particular, for example tagged, molecule is 

located in dr at r at time t given that it was at the origin at t = 0. We can also interpret 
it as the fraction of particles in dr at r at time t given that they were located somewhere 
else at t = 0. Thus 

G(r, 0 = - [r,.(0 - r/0)]}^ (21-269) 

The Fourier transform of this is 

F s (k, 0=1^ E e* (21 -270) 

Note that this is the time-correlation function of the dynamical quantity exp[*k • r(t)] 
since it can be written in the form (if the dynamical quantity is complex, the time 
correlation function is defined as <yf*(t)^(0)» 

F£ k, <) = 1 ^ E e _ik ‘ ,J<0 V k ' rj< ' ) ^ (21-271 ) 

Differentiation of Eq. (21-270) gives Eq. (21-266). 

We can now express this equation as the integral of a time-correlation function. 
Clearly 

r(0~r o = f <ft'v(0 (21-272) 

J o 

and 


[r (0 - 'o] 2 = f dt' f' dt" v (0 • v (0 

J o J o 


(21-273) 
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where v(/) is the velocity of the particle. If we ensemble average both sides of this 
equation, we get 

<[r(0 - r 0 ] 2 > = ( dt' ( if <v(/') • v(0> (21-274) 

J 0 J 0 

Due to the stationarity of the equilibrium ensemble average and the time reversibility 
of the classical mechanical equations of motion, we have that 


<v(0 • v(0> = <v(t' - *") • v(0)> = <v(r - O ‘ v(0)> (21-275) 

If we now change integration variables in Eq. (21-273) to i = f — 1\ change the 
order of integration, and carry out the one obvious integration, we get (Problem 21-37) 


<K0-ro] 2 >=6Z» = 2t 



<v(0) • v(t)> dx 


(21-276) 


If the time-correlation function <v(0) • v(t)> decays sufficiently rapidly and t is chosen 
sufficiently large, then one can neglect the z jt in the integrand and write 

6 Dt —2 1 [ <v( 0) • v(t)> dz t -+ oo (21-277) 

J o 

Since it is assumed that <v(0) • v(t)> has fallen to zero long before t, this integral is 
insensitive to the value of t for large values of t and so it is permissible to write finally 



f <v(0) * v(t)> dz 
J o 


(21-278) 


which is our desired expression for D . It should be clear from the derivation that this 
equation is limited to processes for which Eq. (21-259) is a suitable description, which 
means for “normal” diffusion processes, i.e., ones that occur slowly with respect to 
microscopic times and for which spatial variations are smooth. 

Equation (21-278) is often written in terms of the normalized velocity autocorrela- 
tion, 


<v(0) • v(t)> 
<v(0) • v(0)> 


(21-279) 


Using the fact that <v(0) • v(0)> = 3kTjm y we have 



(21-280) 


Clearly it is no simple matter to calculate C(t) exactly, but several molecular dynamics 
calculations of C(t) have been made. Figure 21-1 1(a) shows C(t) for a system of hard 
spheres for several densities, and Fig. 21-1 1(b) shows C(x) for a system of Lennard- 
Jones molecules. The negative region that occurs at high densities was first observed by 
Alder and Wainwright and is interpreted as the back scattering that occurs when a 
molecule collides with one of the neighboring molecules forming a cage around the 
central one. We shall return to Eq. (21-278) for D after we derive the corresponding 
expressions for the other transport coefficients. 

The calculation of the other coefficients proceeds in a similar manner. In order to 
treat the shear viscosity, one starts with the Navier-Stokes equation with no external 
field [Eq. (17-73)]. 





Figure 21-11. Velocity autocorrelation functions calculated from molecular dynamics, (a) Calculated 
on a hard-sphere system for various densities, indicated by the values of v/v 0 . The abscissa 
is given in multiples of the mean collision time. (From B. J. Alder, D. M. Gass, and 
T. E. Wainwright, J. Chem. Phys ., 53, p. 3813, 1970.) (b) Calculated on a system of 
Lennard-Jones molecules at a reduced density of p* = pa 3 = 0.85. The solid line is for 
a reduced temperature of 4.70, and the dashed line is for a reduced temperature of 0.76. 
As in (a), the abscissa is given in units of mean collision times. (From D. Levesque and 
L. Verlet, Phys. Rev., A2, p. 2514, 1970.) 


where p m is the mass density mp . The stress tensor has the form 

a = — [p 4- (|*7 — jc)V • u]l + 2*7 sym(Vu) (21-282) 

where rj and k are the shear and bulk viscosities, respectively. We shall work with the 
linearized form of the Navier-Stokes equation in which u and dpjdt are small, so that 
we have (Problem 21-38) 

du 

mp = — Vp — (f *7 — *c)V • (V- u)l 4- 2*7 V • (sym Vu) (21-283) 

It is a straightforward exercise to show that (Problem 21-39) 

V ■ (sym V • u) = iV 2 u + iV(V • u) (21-284) 

and 

V • (V • u)l = V(V ■ u) (21-285) 

and so we get the linearized Navier-Stokes equation (Problem 21-40) 

du - 

mp — = Vp + * 7 V 2 u + Q *7 + k)V(V ■ u) (21-286) 

ot 


This equation plays the same role as the diffusion equation does in the calculation of 
the self-diffusion coefficient. 
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If we now take the Fourier transform of this equation and use the following relations 
(Problem 21-41) 

JJJ dr e ik ‘ r Vp = ik/?(k, t) 

JJJ dr e ik ' r V 2 u = — A; 2 J(k, t) 

JJJ dr e ik ' r V(V ■ u) = -k[k * J(k, t)] (21-287) 

where p(k, t) and J(k, t) are the Fourier transforms of the pressure and velocity, 
respectively, we get 

mp = — /k/?(k, t) — rjk 2 J(k, t) — (jrj + k) k(k • J(k, t)] (21-288) 


The scalar product in the last term here suggests that we separate J(k, t) into three 
orthogonal components, /||(k, t), J ±1 ( k, t), and / ±2 (k, t), which are the components of 
J(k, t) parallel and perpendicular to k. Equation (21-288) becomes 

mp ^ = - ikpQt, 0 - (f r\ + K)k 2 J |,(k, I) (21 -289) 


mp ^ = -fcViaQt. 0 a = 1, 2 


(21-290) 


The second of these equations can be solved immediately to give 
/ lo (k, 0 = J ± Jk, 0)e~ k2 ’ l,,mp (21-291 ) 


This equation corresponds to Eq. (21-264) in the derivation of D. It is convenient at 
this point to depart slightly from the procedure used to derive the self-diffusion coeffi- 
cient and to form the time-correlation function of / lct (k, t) by multiplying it by 
/J a (k, 0) =/j. a (— k, 0) and averaging over the equilibrium ensemble. Using Eq. 
(21-291) for J ±ct (k, t), we get 


< jjjfr oy a o> 

< JIA 0)J la (k, 0)> 



(21-292) 


The velocity u(r, t) can be written as 

u(r, 0 = Z y j “ r /01 (21-293) 

j= i 

where y,- is the velocity and r j the position of the y'th particle. It follows from this that 

J(k, t) = (21-294) 

j= i 

If we choose our coordinate system so that k points in the z-direction, then 

^IlCk, t) = £ z/0e ttzj(O (21— 295a) 

j= 1 

/xi(k, <)= Z xj(t)e> kz ' w (21— 295b) 

j= 1 

A 2 (k, t)=Y yj(ty kzjit) 

7 = 1 


(21 -295c) 
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Note from these last two lines that if the fluid is isotropic, then C ±1 (k, t) = C 12 (k, 0 
and we can drop the a subscripts in Eq. (21-292). By following the same procedure as 
before, we can find a time-correlation function expression for r\ by first noting that it 
follows from Eq. (21-292) that 



(21-296) 


Using Eq. (21-295b) for J L ( k, t) in Eq. (21-292) gives 

| ^ E I x,(0)jc/0)e «t*.(°)-MO)i 

(21-297) 

Instead of differentiating this equation according to Eq. (21-296), it is simpler to 
expand the exponential functions in the two ensemble averages in powers of k and 
then compare the coefficients of k 2 on the two sides of the equation. This is equivalent 
to using Eq. (21-296). 

We first note that since the velocities and coordinates are independent in an equilib- 
rium ensemble, the ensemble average on the right-hand side of Eq. (21-297) becomes 



S u e 


»fc[ Zl (0)- Zj (0)] 


> 


Nk B T 

m 


(21-298) 


We have subscripted the Boltzmann constant with B in this equation to avoid confu- 
sion between it and the wave vector appearing in the exponential function. 

The evaluation of the ensemble average on the left-hand side of Eq. (21-297) 
requires some amount of manipulation. Expanding this ensemble average in powers of 
k gives 



E x,(0x/0)^ + ifc^E E^i(0x/O)[z ( (<) - Zj( 0 )]j 

- 7 ( ( E E*.(0*;(0)[z,(0 - z/0)] 2 ^ 


+ 


(21-299) 


The first term can be evaluated by noting that because of conservation of momentum, 


E m = E m 

i=i i=i 


(21-300) 


Therefore the first term becomes simply 

/ N N \ / N N \ KTU T 

( E E *|(0*;(0) ) = ( E E x,(0)x/0) ) = — (21-301 ) 

\i=ij=i / \i= i j= i / m 

To evaluate the second term in Eq. (21-299), we write it as the difference between two 
terms. For the first of these, write (Problem 21-60) 

(e E * i (0*/o)z j (o') =(e E *i(0*,<0zi(0 

V=i j=i / \i=ii=i 

N N \ 

E E ^(oy/oyto)) (21-302) 

1=1 j=l / 
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The first step follows from conservation of momentum, and the second follows from 
the stationarity condition for an equilibrium ensemble average. By applying the same 
argument to the other part of the second term in Eq. (21-299), we see that we get the 
same result, and so the coefficient of k in Eq. (21-299) is equal to zero. 

Putting all of these results in Eq. (21-297) gives then 


Nk n T k 2 


m 


Z*.<0*/o)[z,(>) - z/o)] 2 ^ 


+ 


Nk„T 


m 


\ pm J 


from which we find that the coefficient of shear viscosity is 


N N 


n = 


2Vk B Tt \i=i j^i 


I I pJt)P*0Xzft) 


- z/0)] 2 ^ 


(21-303) 

(21-304) 


where we have written mxj as p xj . This almost corresponds to Eq. (21-268). In order 
to convert this into an integral over a time correlation as we did in going from 
Eq. (21-268) to Eq. (21-276), it is necessary to write this in the equivalent form 
(Problem 21-42) 

9 = 2 Vk B Tt (yf - z /0)P*/0)] 2 ^ (21 -305) 


The procedure in going from Eq. (21-304) to Eq. (21-305) involves the same type of 
manipulations as in Eqs. (21-299) and (21-302). (See Problem 21-60.) Note that 
Eq. (21-305) is in exactly the same form as Eq. (21-268), namely, the ensemble 
average of [A(t) — >4(0)] 2 , where A(t) is r /t) in Eq. (21-268) and Zj(t)p xj (t) in the above 
equation. 

By differentiating ZjP xJ with respect to time, we see (by working backward) that 
## - z /0)p x ,(0) = jV{ [ P2j( ^ <0 ] + (21-306) 


where F jx (t') is the x-component of the intermodular forces acting upon particle j at 
time t'. By squaring this, taking the ensemble average, and changing variables of 
integration, we finally get the desired result (Problem 21-43) 




(21-307) 


where 




It is possible to symmetrize the second summation here and write J as 

J = m Y*jZj+\ Z Z ij F ij X 
j - 1 


(21-308) 


(21-309) 


where z lV = z t — z y and F^* = F* — Fj*. 

The pattern for deriving expressions for the thermal transport coefficients in terms 
of integrals of time-correlation functions should be clear. We first select the appro- 
priate macroscopic transport equation and solve it in Fourier transform space to de- 
rive an equation for the Fourier transform or its correlation function in terms of the 
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macroscopic transport coefficients [cf. Eqs. (21-264) and (21-292)]. We then identify 
this same Fourier transform in terms of microscopic variables such as the positions 
and momenta of the molecules of the system [cf. Eqs. (21-270) and (21-295)]. Lastly 
we expand both the macroscopic and microscopic representations of the Fourier 
transform (or its correlation function) in powers of the Fourier transform variable k 
and equate the coefficients of k 2 9 thus giving the macroscopic coefficient in terms of an 
ensemble average of microscopic variables. This is then manipulated into the form 


y = - m) 2 > 


(21-310) 


where y is the macroscopic transport coefficient and A(t) is some function of micro- 
scopic dynamical variables. Lastly, A(t) is written as the integral of dA/dt, and Eq. 
(21-310) is transformed into the desired form 


= fdt <A(t)Am 


(21-311) 


We shall do this briefly for one final example, the thermal conductivity. 
We start with the heat conduction equation, Eq. (17-31), 

pC v ^ = XV 2 E 


(21-312) 


where we have written E instead of T as the dependent variable. We consider E( r, t) 
here to be the excess above the average energy. If we let L(k, t) be the Fourier trans- 
form of E(t , /), then the above equation yields 

L(k, t) = L(k, 0)e~ Xk2tlpCv (21 -31 3) 

Now the microscopic representation of £(r, t) is 


£(r,0 = I£ y <5(r-r,) 

j=l 

where 

Ej=Ej-<Ej> 

The Fourier transform of this gives 


(21-31 4) 


(21-315) 


L(k,o= E V k ' r ' (0 

/= l 

Without loss of generality, we can take k in the z-direction, and so 


(21-316) 


L(k, t) = £ Eje ikz ^ 
j= i 


(21-317) 


By multiplying both sides of Eq. (21-313) by L*(k, 0) and taking the ensemble average, 
we get 

(i 


= (t | Ej(0) W‘ [ZI(0) - Z ' (0)] ) exp(-^p) 


(21-318) 


This equation is very similar to Eq. (21-297). The procedure from here is essentially 
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the same as that following Eq. (21-297). To evaluate the leading term in either en- 
semble average, we use the fact that 

= <( £ - < £ » 2 > (21-31 9) 

which according to fluctuation theory satisfies [cf. Eq. (3-42)] 

(.(E — <E» 2 > = k B T 2 C„ (21-320) 

Thus it should be easy to see that we eventually can get (Problem 21-44) 

A = 2Vk B T 2 t (.1, EWO " z,(0)] 2 £,<0) £ /0^ (21-321 ) 

This equation closely resembles Eq. (21-304). Using conservation of energy, we can 
write it as 

k = 2 vl B T 2 t ( tiXjWEjO) ~ x/M/M) (21 -322) 

from which we get the final result 


A = J 0 <5( * )5(0)> dt (21 ” 323) 

where 

d * « 

E(l)=~r Z x j E i (21-324) 

at j= i 

One usually sees this equation written in terms of a vectorial flux S(f): 


1 


3Vk B T 2 J 0 


!°°<S(0 • S(0)> dt 


where 


d N 

dt j= i 




(21-325) 


(21-326) 


where </?,) is the enthalpy per molecule. 

We purposely passed over the expression for the bulk viscosity coefficient since its 
derivation is fairly subtle (e.g., see Helfand’s article). In fact, there was some controversy 
concerning its correct form for a time. This controversy no longer exists, and the 
accepted expression is* 


i4rC^??? <jaa(0)jWw> (21-327) 


* R. Zwanzig, Ann. Rev. Phys. Chem 16, p. 67, 1965. 
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where a and b run over x , y, and z and 


N 


jot = £ PjaPjt + £ p _ <r * > 
j= 1 m j= 1 


(21-328) 


where r Ja is the ath component of r 7 - and is a function of pressure p, internal 
energy E, and particle number N: 


<J at > = ^ab V 




<N>) + ^I (£ “ <£>) ] 


(21-329) 


This which does not occur in the case of the shear viscosity coefficient, takes 

account of the ensemble used for the averages. If a microcanonical ensemble is used, 
both the second and third terms vanish. For a canonical ensemble, only the second term 
vanishes, whereas for a grand canonical ensemble, none of them vanish. 

In this section we have derived time-correlation function expressions for the thermal 
transport coefficients. These equations do not possess the rigor or generality of those 
in the previous sections of this chapter, but they are valid for normal hydrodynamic 
behavior, i.e., for slow processes and for smoothly varying spatial inhomogeneities. 
In the next chapter, we shall study the limitations of these equations more carefully. 

Clearly, these equations are quite formal since an exact determination of the appro- 
priate time-correlation function is a difficult problem, but the simplicity of the final 
formulas and the physically appealing nature of a time-correlation function allow one 
to make simple, intuitive approximations, which can afford good agreement with 
experiment. In addition, at least we now have equations of the correct formal 
structure, which can be used to derive general theorems such as a law of corresponding 
states for transport in dense fluids. We shall discuss both such applications in the next 
section. 


21-9 APPLICATIONS OF THE TIME-CORRELATION FUNCTION 

FORMULAS FOR THE THERMAL TRANSPORT COEFFICIENTS 

As we have stated before, the actual calculation of a time-correlation function is a 
very difficult task in general. In the dilute gas limit, however, where the Boltzmann 
equation serves as the exact kinetic theory equation, the various time-correlation 
functions can be evaluated exactly. This has been done by McLennan and Swenson,* 
who derive at an intermediate stage an integral equation which is identical with that 
appearing in the first Chapman-Enskog solution of the Boltzmann equation. Thus we 
see the not so surprising result that in order to evaluate a time-correlation function 
exactly in the low-density limit, we are essentially back to the Boltzmann equation. 
Instead of discussing this exact correspondence, we shall instead present some approxi- 
mate calculations in this section. One of the appealing aspects of the time-correlation 
function formalism is that it gives not only a formally exact representation of various 
transport coefficients, but that it also does this in terms of an intuitively simple function 
for which one can “ derive ” approximate forms. In this section we shall first discuss 
the dilute gas limit, then dense fluids, and finally the law of corresponding states for 
transport processes. 


J. A. McLennan, Jr., and R. J. Swenson,/. Math. Phys., 4, p. 1527, 1963. 
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In the dilute gas limit we can neglect the terms involving the intermolecular potential 
in Eqs. (21-278), (21-307), (21-325), and (21-327) to write 

D = —\ <P«(0)PJ0> dt (21-330) 

m 

n = y k T m 2 J o jLPb@)PlJ®PjAQPjJ(.t)^ dt (21-331 ) 

* ■ - mf ^ - <^“>)C?. *sr - < J “>) * (2, - 332 > 


2 - 5 dr‘ C ii I, (*ST - ikT ) (# - *‘ T ) * 


(21-333) 


Note that each of these equations is dependent only upon various functions of the 
momenta of the particles. Clearly the most accurate way to proceed is to solve the 
Boltzmann equation for the probability that a molecule has a momentum p Jjt) at time 
t given that it had some momentum p j initially, i.e., for the singlet momentum dis- 
tribution function. This is the approach mentioned above that leads to essentially the 
Chapman-Enskog development. 

Instead of carrying out this complicated Chapman-Enskog approach, we shall 
evaluate Eqs. (21-330) through (21-333) by a mean free path model. Before doing this, 
however, it is instructive to show that the bulk viscosity of a monatomic ideal gas is 
zero. As Eq. (21-329) indicates, one must specify the ensemble used to evaluate k, 
and we shall use the microcanonical ensemble for simplicity. In this ensemble, 
Eq. (21-332) becomes 


K = ^ 1 1 j”<2K fl (0) - pV)(2K b (x) - pV)> dt 


(21-334) 


where we have introduced the total kinetic energy in the aih direction, K a , and used 
the fact that < J aa y = pV in a microcanonical ensemble. If we now sum this over a and b 
and use the fact that the total kinetic energy K equals K x + K y + K z , then Eq. (21-334) 
becomes 


k = f <[2tf(0) -3pV][2K(t) - 3 pV]> dt (21-335) 

9 VkT •'q 

Now since we are neglecting the intermolecular potential in this dilute gas limit, the 
kinetic energy is equal to the total energy, and since we are averaging over a micro- 
canonical ensemble, this energy is fixed, so that AT(0) = K(t) = E. It is easy to show from 
Eqs. (1-37) and Eq. (3-24) that pV = 2Ej3, and so we see that both factors in the 
integrand of Eq. (21-335) vanish. It is a good exercise to prove this in the canonical 
and grand canonical ensembles as well. 

We shall now evaluate Eqs. (21-330), (21-331), and (21-333) in a mean free path 
approximation.* We assume that the motion of the molecules in the gas can be 


* W. A. Steele, in Transport Phenomena in Fluids , edited by H. J. M. Hanley (New York: Dekker, 1969), 
Chapter 8. 
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described by a succession of straight-line paths, interrupted intermittently by collisions 
with other molecules. We furthermore assume that the momentum vector after a 
collision has a completely random distribution relative to its previous direction. 
In other words, we are assuming that the correlation of the momentum, or any 
function of the momentum, is complete until a collision occurs and is zero after- 
ward. It is possible to account for the correlation between successive collisions by 
computing the so-called persistence of velocity effect, but we shall not do this here. 

If we let P{p/0), p/0); t) be the fraction of molecules with momenta p { and p 7 
which will not collide and hence still be correlated at time t , we can write 


Gif) = (£ t /[p f (0)]^[p/0)]P{p f (0), p/0); t} 


(21-336) 


where / and g are arbitrary functions and the angular brackets now denote an average 
over all initial momenta. 

The fraction of the molecules that have not collided by time t can be calculated in the 
following way. Let the collision frequency of the molecules with momenta p and p' 
be 1/t c (p, p'). Then we can write that 


dP_ _P 
dt r c 

which can be integrated to 

We can substitute this into Eq. (21-336) to get 

&=^2<Pix 2 *c(Pi,Pj)> 


(21-337) 


(21-338) 


(21-339) 


’ -vik? (I, »/>) 

A - wp (I, (£ - ikT ) (£; ' ikT ) ^ ^ ■ Pj) ) 


(21-340) 


(21-341) 


If we neglect correlations between different molecules, then all cross terms in these 
three equations vanish, and we have 


D =-^2 <Px\(p)> 

TYl 


(21-342) 


n 


P 

kTm 2 


<p x 2 p y \(p)> 


X 



(21-343) 

(21-344) 


If we now introduce the Maxwell-Boltzmann distribution function and the reduced 
momentum 


w 


2 


P 


2 


2mkT 
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Eqs. (21-342) through (21-344) become 


n ikT f“ 4 -*i , v . 

(21-345) 

1 “ 15 ^l B J o dw 

(21-346) 


(21-347) 


These equations have been derived by Monchick and Mason* by calculating the 
first-order deviations from the Maxwell-Boltzmann distribution due to the combined 
effects of collisions and gradients in the appropriate macroscopic variables. 

Certainly the simplest approximation that we can make for t c (w) is to assume that it 
is independent of w and equal to simply the mean free path divided by the average 
velocity so that 





(21-348) 


If we use this in Eqs. (21-345) through (21-347), we get (Problem 24-46) 


i 

/ kT \ 1/2 1 


4 n 1 ' 2 

\/n/ a 2 p 

(21-349) 

1 

(mkT) 112 

(21-350) 

An 1 ' 2 

a 2 

5 /kl 

Y /2 C. 


6 \nr. 

n) N 0 u 2 

(21-351) 


These results are to be compared with Eqs. (16-36) through (16-39), where it can be 
seen that the agreement with the simple mean free path results of that chapter is 
quite good. 

Gordonf has presented a more sophisticated kinetic-theory evaluation of time- 
correlation functions which we present now. This approach allows one to account for 
higher-order collisions in a clear way. To begin with, we assume that the intermolecular 
forces have a finite range b 0 , beyond which they vanish. At the end of the calculation 
we shall let b 0 -> oo. We shall evaluate the momentum correlation function as an 
example. For those molecules that have not yet collided, we have that <p(0) • p(0> = 
<p(0) • p(0)>. If we assume molecular chaos, that is, if we assume that the number of 
collisions that a molecule suffers in any time interval is a Poisson process, then the 
fraction of molecules that have suffered exactly n collisions in the interval (0, t) is 



(21-352) 


where t _1 is the mean collision frequency. Therefore the number of molecules that 
has not collided in that time interval is exp (— */t), and the contribution to the momen- 
tum time-correlation function is <p(0) • p(0)>exp(— t/z). 


* L. Monchick and E. A. Mason, Phys. Fluids , 10, p. 1377, 1967. 
t R. G. Gordon, J. Chem. Phys., 44, p. 228, 1966. 
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The fraction of molecules that has collided once and only once by time t is 
(*/x)exp(— */t), and so the contribution to <p(0) • p(*)> from these molecules is 

<p(0) • p(l)> 0/i)exp(-r/T), 

where p(l) is the value of p after the first collision. Clearly we can continue in this way 
and write 

<p(0) • p( 0> = exp(- ^ [<P(0) • p(0)> + *- <p(0) • p(l)> 

+ <K0) • p(2)> + • • •] (21 - 353 ) 

In the previous calculation we neglected all the terms beyond the first term in brackets, 
but here we shall account for them in the following manner. 

We first integrate Eq. (21-353) to write 

3m 2 D = f °°(p(0) • p(0> dt 
J o 

= t[<P 2 (0)> + <p(0) - p(l)> + <p(0) • p(2)> + • • ] ( 21 - 354 ) 

We multiply and divide by <p 2 — p(0) • p(l)> and rearrange to get (Problem 21-47) 


3m2 D = / 2 mx «P 2 > 2 + t<P 2 ><P(0) • p(2)> - <p(0) • p(l)> 2 ] 

<p -p(U) p(l)> 

+ [<p 2 > <P(0) • p(3)> - <p(0) • p(l)> <P(0) • p(2)>] + — } ( 21 - 355 ) 


The terms in brackets in this equation represent a correlation between the effects of 
successive collisions and should be rather small (Problem 21-48). We can neglect these 
and write 


3 m 2 D 


T<P 2 ) 2 

<P 2 - p(0) • p(l)> 


( 21 - 356 ) 


This result can be rearranged into a slightly more convenient form by using the fact 
that a Boltzmann distribution of p before a collision remains a Boltzmann distribution 
after a collision when averaged over all types of collisions, so that 

<P 2 > = <P 2 (0)> = <P 2 (1)> 

But 

<P 2 (1)> = <(P(0) + [p(l) - P(0)]} 2 > 

= < P 2 (o)> + 2<p(o) • ip(i) - mr> + <[p(i) - p(o)] 2 > 

and so 

0 = 2<p(0) • [p(l) - p(0)]> + <[p(l) - p(0)] 2 > 

Thus 


<P 2 - p(0) • p(l)> = <p(0) • [p(0) - p(l)]> 

= Ktp(i)-p(0)] 2 > = K(Ap) 2 > 


and so 


3 m 2 D 


2t<p 2 ) 2 

<(Ap) 2 > 


( 21 - 357 ) 
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The average over impact parameter b may be written explicitly as 

<(Ap) 2 > =-L f t0 <(Ap) 2 >2 nbdb 
nt>o J o 


(21-358) 


If we give t in Eq. (21-357) its kinetic theory value x = 1 lpv r nb 0 2 = 1/2 112 pvnb 0 2 , 
where v r is the mean relative velocity, then Eq. (21-357) becomes 


3 m 2 D = 


2<P 2 > 2 

pv r Jo° <(Ap) 2 >27tfe db 


or 


D = 


2<p 2 ) 2 

3 m 2 pv r Jo <(Ap) 2 >2?rh db 


(21-359) 


We now show that this equation is the Chapman-Enskog result for D. The average of 
(Ap) 2 in Eq. (21-359) is over all collisions and has already been evaluated in Sec- 
tion 16-3. It is shown there that (Ap) 2 = m 2 v 2 ( 1 — cos *)/2, where x is the deflection 
angle and that the fraction of collisions with relative velocity v r is 


1 

8 


(_m\ 2 v 3 p -mvS/4-kT 

\kT ) Vr 


dv r 


Thus we have that 


/ yyi \ 2 - oo 

<(Ap) 2 > = — (— ) J o p r s (l - cos x ) e - mv ' 2 /* kT dv r (21 -360) 

and 

J <(Ap) 2 >2 nb db = ^ (jfy 2 J ” v r 5 c(v r )e ~ mv ' 1 i AkT dVf 

r 00 

= 4 mkT \ g 5 o(g)e~ 02 dg (21-361) 

J o 

where g is a reduced relative velocity, g 2 = mv r z l4kT. The coefficient of self-diffusion 
becomes 


^ 2(3mkT) 2 / nm \ 1/2 1 

3pm 2 \16kT/ 4mkT ^g s aig)e~ eI dg (21-362) 

Using the standard definition of the collision integrals [Eqs. (19—19) through (19-21)], 
we see that 


< 2i - 363) 

which is Eq. (19-26), the Chapman-Enskog result. 

The calculation of the time-correlation function for a dense system is, of course, much 
more difficult than in the binary collision limit. The exact determination of these correla- 
tion functions is a complicated dynamical problem involving the motion of many par- 
ticles. Molecular dynamics calculations have been carried out for a number of systems 
and these have shed much light on the behavior of time-correlation functions. For 
example Fig. 21-11 shows the normalized velocity correlation function for both a 
system, of hard spheres and a system of Lennard-Jones molecules. This molecular 
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dynamics information can be used to suggest simple analytical expressions, which can 
be used to approximate the time-correlation function. These approximate expressions 
contain parameters which can be determined in several ways, most obviously by 
requiring that the first few terms in the Maclaurin expansion for the correlation 
function be exact. We shall illustrate this approach for the coefficient of self-diffusion. 
We start with 


D = — — f 4>(t) dt 
m ^ 


where 


<m • p(o> 


m <m m> 

We expand <f>(t) in a Maclaurin expansion, giving 
<Kt)=i(-iTA 2n t 2 ” 

n= 0 

where 

<p (w) (0) ■ p (f<) (0)> 

2 " {2n)\3mkT 

where the notation p (n) (0) denotes the nth time derivative of p evaluated at t = 0. 
Clearly A 0 = 1. To evaluate A 2 , it is convenient to first use the fact that 

<p(0)*p(0)> = "<p(0)*p(0)> 
and that in the Poisson bracket notation 


( 21 - 364 ) 


( 21 - 365 ) 


( 21 - 366 ) 


( 21 - 367 ) 


p 1 = -{H,p 1 } = -V 1 H=-V l U N 

where V x represents the gradient with respect to the spatial coordinates of particle 1. 
By repeating this and then averaging over all initial momenta and coordinates, we get 

<p(0)-p(0)>= -kT<y 2 u„y 

If we assume that U N is pair-wise additive, then 

<p 2 (0)> = N kT<y 2 u(r 1 2 )> 


= pkTj g(r)V 2 u 4nr 2 dr 


( 21 - 368 ) 


which is essentially A 2 - 

Perhaps the simplest approximation that we can make for $(*) is to assume that it is 
Gaussian,* in which case we have <t>(t) = exp (—^ 2 * 2 ) an d 


D 


kT /_rc\ 1/2 

m \a 2 J 


( 21 - 369 ) 


This form for <f)(t) does not give the negative region found from molecular dynamics 
calculations, but requires only one moment, A 2 . Below we shall introduce a more 
flexible form for <t>(t) that does give the negative region, but, in addition, requires A a 
as well as A 2 . The A 2n become progressively more difficult to calculate, and A± is 


* S. A. Rice,/. Chem. Phys 33, p. 1376, 1960. 
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probably the practical limit. There are not a great deal of data for the coefficient of 
self-diffusion in dense fluids, but it has been measured in liquid argon at 90°K and a 
density of 1.374 g/cm 3 . Using a Lennard- Jones 6-12 potential with potential param- 
eters ejk = 120°K and o = 3.4 A and the molecular dynamics radial distribution 
function of Verlet,* Eq. (21-368) can be integrated numerically to give A 2 = 3.94 x 10 25 
sec _2 ,t which according to Eq. (21-369) gives a value of D = 3.44 x 10“ 5 cm 2 /sec. 
This is to be compared to experimental values of 2.43 x 10“ 5 J cm 2 /sec and 2.06 
x 10” 5 § cm 2 /sec and Rahman’s molecular dynamics value of 2.43 x 10” 5 cm 2 /sec.** 

It is possible to carry this procedure one term further and to use A 4 . This would 
allow one to use an approximate form for <£(*) that contains two parameters. A par- 
ticularly convenient form is <£(*) = sech(at)cos(bt). The advantage of this form is that 
it is even in time, gives the negative region that is found in the molecular dynamics 
calculations, and decays as e~ at cos bt for long times. Douglasstt and Isbister and 
McQuarriet have carried out calculations using this form for <p(t) and find reasonably 
good agreement with available experimental and molecular dynamics values for D. 
The A 2n become progressively more difficult to calculate, and so this procedure is not 
readily extended. Nevertheless, it appears to be a useful method for estimating the 
transport coefficients, although calculations have been done only for D in liquid 
argon (Problem 21-59). 

We have often stated that the odd powers of t do not appear in the Maclaurin 
expansion of a classical time-correlation function. In a hard-sphere fluid, however, 
the discontinuous nature of the intermolecular potential gives impulsive forces so that 
the time derivatives of the velocity of a molecule are no longer well defined or well 
behaved. In this case, then, instead of an equation like Eq. (21-202), we write 

<v(0) • v(0> = <v 2 > + *(!) o <v(0) • *0> + J (^) o <v( 0 ) • v(f)> + • • • (21-370) 

where the time differentiation is done after the ensemble averaging. When this pro- 
cedure is carried out for hard spheres, it is found that the linear term in t in the above 
equation is nonzero, implying that the autocorrelation function has a discontinuous 
slope at the origin. For a hard-sphere system, then, we can assume that <v(0) • v(*)> 
decays exponentially: 

< v i(0) " v i(0> = ( v i 2 y e ~ at (21-371) 

and determine a from the linear term in Eq. (21-370): 


(d\ <^(0) • v^Q) 

Wo <”i 2 > 


= — lim 

Af->0 


= — lim 

Ar->0 



(21-372) 


♦ L. Verlet, Phys. Rev., 165, p. 201, 1968. 

t D. J. Isbister and D. A. McQuarrie, /. Chem. Phys., 56, p. 736, 1972. 
J J. Naghizadeh and S. A. Rice, /. Chem. Phys., 36, p. 2710, 1962. 

§ J. W. Corbett and J. H. Wang, /. Chem. Phys., 25, p. 422, 1956. 

** A. Rahman, Phys. Rev., 136, p. A405, 1964. 
ft D- C. Douglass, J. Chem. Phys., 35, p. 81, 1961. 
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We can evaluate this expression in much the same manner that we evaluated 
<(Ap) 2 > in Eq. (21-359). From Eqs. (16-50) and (16-64), we have 

Pi • Ap t = tn l p{y l • ¥/ - Vi • v r ) 

= p 2 v r 2 (cos x ~ 1) + Wi/i(v c • v/ - v c • v r ) (21-373) 


We are going to average this over collisions, in which case the second and third terms 
here will vanish. If we average this over the rate of collisions of some “tagged” 
molecule with the others, we get, using Eq. (16-68) with p 1 = 1 and p 2 = p. 



p m 2 na 2 
= 4 


(^) 3/2 jVe-^ /4 * T dv r = 8 no 2 p^ W /2 

(21-374) 


This gives 


87ro 2 p tkT\ 112 

3 \nm) 


(21-375) 


and a coefficient of self-diffusion 

n _kT 3 /kT\ 112 1 
ma Sn 1/2 \ m ) pa 2 


(21-376) 


which is the Chapman-Enskog result for hard spheres. Thus we see that for a hard- 
sphere system, an exponentially decaying time-correlation function gives the Chapman- 
Enskog result. More interestingly, if we multiply the collision rate in Eq. (21-374) by 
p(l), the hard-sphere radial distribution function at contact, a is increased by g(l) 
and D is decreased by g(l), giving the Enskog dense hard-sphere fluid result for D 
[cf. Eq. (19-45)]. So we see that the Enskog theory is in some sense equivalent to 
assuming an exponentially decaying time-correlation function.* 

This has been shown also for the viscosity of a hard-sphere fluid. Wainwrightt has 
shown that if onfe assumes that the terms in the autocorrelation function giving the 
viscosity [cf. Eqs. (21-307) and (21-308)] are in the form of a decaying exponential, 
then one gets 


rj c L 

-or - =-+c 2 + c 3 y 

n bop y 


where the c } are quantities similar to a in Eq. (21-372). The integrals here are more 
complicated than in the case of the coefficient of self-diffusion above since the auto- 
correlation function giving the viscosity is more complicated, but by using a detailed 
analysis of hard-sphere collisions, Wainwright is able to show that c x = 1, c 2 =f, 
and c 3 = 0.7649, giving 


n 

ri°b 0 P 


1 4 

= - + - + 0.7649 y 

y 5 


as compared to the Enskog result [cf. Eq. (19-41)] 


n 

fl°b 0 P 


= - + ^ + 0.76>> 

y 5 


* This was apparently first shown by H. C. Longuet-Higgins and J. A. Pople, /. Chem. Phys., 25, p. 884, 
1956. 

t T. E. Wainwright, /. Chem . Phys ., 40, p. 2932, 1964. 
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Alder, Gass, and Wainwright* have made a detailed analysis of molecular dynamics 
calculations and the Enskog theory. 

The last topic that we shall discuss in this rather gigantic chapter is the law of 
corresponding states for the thermal transport processes.*!* Consider Eq. (21-307) for 
the viscosity coefficient: 

n = f< mm> at (21 -s??) 

where 

3 * Jk (hJt + ZjFjx ) (21-378) 

We assume that the intermolecular potential is pair-wise additive and of the form 
u(r) = ef(rja) and introduce the reduced variables 

r kT ^ u 

/**=-, T* = — , u* = - 
g e e 

From this basic set, we can see that since time has units of [mass-(distance) 2 /energy] 1/2 , 
then t is reduced by (c/m<x 2 ) 1/2 , i.e., t* = tz xl2 \m U2 G. Similarly we see that v* = i;(m/c) 1/2 , 
etc. This all allows us to write J in Eq. (21-377) in the reduced form J* = J/e so that we 
get 

m l/2 £ l/2 

t] = = — t]* (21-379) 

G z 

where 

ri* = p*) = /J <J*(0)J*(0> dt* (21-380) 


It is easy to show that the other thermal transport coefficients are (Problem 21-51) 


Xke*' 2 


A ~tn ll2 a 2 

(21-381) 

Dm 112 


D* =-jjr~ 

E y,z G 

(21-382) 

* K ° 2 


K ~m 1 'h 1 ' 2 

(21-383) 


Tham and GubbinsJ have made an extensive test of these relations; the results are 
shown in Figs. 21-12 through 21-14. The data plotted in Figs. 21-12 and 21-13 cover 
densities ranging from dilute gas to dense liquid, and pressures from 0 to 2000 atm. 
Not enough data are available to test Eq. (21-383) for k*. 

♦ B. J. Alder, D. M. Gass, and T. E. Wainwright,/. Chem. Phys ., 53, p. 3813, 1970. 
t E. Helfand and S. A. Rice, /. Chem. Phys., 32, p. 1642, 1960. 

t M. J. Tham and K. E. Gubbins, I and EC Fundamentals, 8, p. 791, 1969 ; 9, p. 63, 1970. 
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Figure 21-12. Reduced viscosity of inert gases. (From M. J. Tham and K. E. Gubbins, bid. Eng. Chem ., 
Fundam., 8, 791, 1969.) 



Figure 21-13. Reduced thermal conductivity of inert gases. (From M. J. Tham and K. E. Gubbins, Ind. 
Eng. Chem. Fundam ., 8, 791, 1969.) 



Figure 21-14. Reduced self-diffusion coefficient for saturated liquid inert gases. (From M. J. Tham and 
K. E. Gubbins, Ind. Eng . Chem. Fundam ., 8, 791, 1969.) 
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PROBLEMS 

21-1. Derive Eq. (21-12) from Eq. (21-11). 

21-2. Prove that if 3^o\n} = E„\n>, then e~ i *’° m \ny — e~ iEnt,tl and that <« | e ijr ° m = 
<n\e lE " tlh 

21-3. Show that 

<e • M(0)e • M(t)> = i<M(0) • M (f)> 
for an isotropic fluid. 

21-4. Referring to Fig. 21-3, show that if we let a AB be the components of the polariz- 
ability tensor in a molecule-fixed coordinate system, then the components in an arbitrary 
system are given by Eq. (21-52). 

21-5. Prove that the direction cosines satisfy 
Cxj 2 + Cyj 2 + Czj 2 = 1 


and 


Cxi Cxj + Cyt Cyj + Czi Czj 0 i ^ j 
21-6. Prove Eq. (21-57). 

21-7. Show that if the three unprimed axes in Fig. 21-3 are principal axes of the molecule, 
then Eq. (21-52) reduces to Eq. (21-53). 

21-8. Derive Eqs. (21-62) to (21-64). 

21-9. Show that the square of the anisotropic part of the polarizability is 
jS 2 = i[(«i — a 2 ) 2 + (a 2 — a 3 ) 2 + (a 3 — <Xi) 2 ] 

21-10. Show that 
a = $(a„ + 2a ± ) 

and 

J3 2 = §(«n — «±) 2 

for a linear or symmetric top molecule. 

21-11. Derive Eqs. (21-70) through (21-72). 

21-12. In Section 21-3, it is necessary to evaluate averages such as 
where e t n is the fth component of a unit vector, £j l is the yth component of a unit vector per- 
pendicular to the first, and the angular brackets denote a spherical average. The main difficulty 
here is to write out the components of the unit vector e L . There are several ways to do this, 
but one is the following. 

We start with a unit vector along the z-axis and rotate the coordinate system about the 
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Figure 21-15. The rotations used to find a unit vector perpendicular to an arbitrary unit vector c. 

x-axis by 0 degrees (cf. Fig. 21-15). Show that the matrix that gives the unit vector in the new 
coordinate system is 

(10 o \ 

Re = 1 0 cos 0 sin0 

\0 — sin# cos 0/ 

Show that the components of the unit vector in this coordinate system are (0, sin 0 , cos 0). 

Now rotate this new coordinate system clockwise about the z'-axis by ( 7 t / 2 ) — <f> radians 
(cf. Fig. 21-15). Show that the matrix that expresses the vector (0, sin 0, cos 0) in this coor- 
dinate system is 

( sin <f> cos (f> 0\ 

—cos <f> sin^ 01 

0 0 l) 

Show that the unit vector in this new coordinate system is given by (sin 0 cos <f>, sin 6 sin <£, 
cos 0), which is the conventional result for a unit vector in a spherical coordinate system. 
Thus the matrix 

( sin <f> cos 0 cos <f> 

—cos <f> cos 0 sin <f> 

0 —sin 0 

acts upon a vector in the original coordinate system and expresses it in the new coordinate 
system. 

Now choose a unit vector in the original coordinate system that is perpendicular to 
(0, 0, 1) there. This is e- 1 - = (cos a, sin a, 0), where a is arbitrary. Show that the new vector 
RtRe e 1 is perpendicular to (sin 0 cos (f>, sin 0 sin <f>, cos 0). This gives the components of a 
vector perpendicular to a unit vector e. Verify that the results given in Eq. (21-91) are inde- 
pendent of the arbitrary angle a. 

21-13. Another way to derive the results given in Eq. (21-91) is to begin with a coordinate 
system in which there is a unit vector along the z-axis and a unit vector perpendicular to 
that, for example, with components (cos a, sin a, 0). We now rotate the coordinate system 


sin 0 cos (f> 
sin 0 sin </> 
cos 0 
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through the three Euler angles <f>, 6 , ip. An excellent discussion of the Euler angles is given in 
Classical Mechanics by H. Goldstein (Reading, Mass. : Addison-Wesley, 1950). It is shown 
there that the matrix corresponding to a rotation through the three Eulerian angles 0 < 6 < ir, 
0 < <p < 2rr y and 0 < tp < 2tt is 

( cos tp cos <f> — cos 6 sin <f> sin tp —sin tp cos <p — cos 6 sin <f> cos tp sin 0 sin (p \ 
cos ip sin <p 4- cos 0 cos <p sin tp — sin ^ sin ^ + cos 6 cos <p cos ip — sin 0 cos ^1 
sin 6 sin tp sin 6 cos ip cos 6 ] 

Now start with a unit vector along the z- axis, (0, 0, 1), and one perpendicular to this 
(cos a, sin a, 0); rotate the coordinate system through the Euler angles; find Ae and 
and then verify Eq. (21-91) by averaging over the three Euler angles, remembering that the 
“ volume element ” is sin 6 d6 d<p dtp. 

21-14. Derive Eqs. (21-92) and (21-93). 

21-15. Derive Eqs. (21-99) and (21-100). 

21-16. Prove that 

ih( J)mn = (E n - E m )( M)m„ + 0(E) 

where J = M; i.e., derive Eq. (21-136). 

21-17. Prove that 

[J(f)U - + 0(E) 

i.e., derive Eq. (21-137). 

21-18. Prove that the trace of a product of matrices is invariant to acyclic permutation of 
the order of the matrices. 

21-19. Prove that both 


M0 = ^TT{ P [M ay Mm 

and 

M0 = ^Tr{[p,MJ4(f)} 

go to the correct classical limit. 

21-20. Prove that the Kubo transform of an operator A becomes simply the classical 
expression for A in the classical limit. 

21-21. Derive Eq. (21-237). 

21-22. Let 


<P(t) = ^Tr{p[A,B(t)]} 

be designated by <Pba(0- Then show that 4>baU) is real and that <p B A(—t) = <Pab(0. 
21-23. Prove that 



w daj' 

OJ 2 — OJ 


• 2 


21-24. Show that the Debye equations, Eqs. (21-177) and (21-178), satisfy the Kramers- 
Kronig relations. 

21-25. Prove that 


= *(x) 
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21-26. Equation (21-144) follows from Eq. (21-143) in the following way. Write out the 
trace operation appearing in Eq. (21-143) to get 

Tr{[p, Ma]Jb(t)} = 222 (pu MjkJki — MijpjkJkt) 

t J k 

Now show that 


8 tJ e- pE J 

Pu ~ Q 

Substitute this into the above triple summation to get 

2 ZipjMjkJkj- MjkPkJkj) 

J k 

Now use the fact that 

pk = pje - p <*k-*P 

and Eq. (21-136) to derive Eq. (21-144). 

21-27. Prove that Eq. (21-127) is the first-order solution to the time-dependent Schrodin- 
ger equation 


21-28. Prove that the matrix elements of the Kubo transform of an operator A are given 
by 


M ' = { E,-Ej f A,J 

21-29. Following through the derivation given in Section 21-4, prove that if 


3/e .(/) = —AF(t) 


and the response is given by B, then 


<B(t)y = f‘ dt'frt - t')F(t') + 0(F 2 ) 

Jo 

where 

<Kt) = ^Tr{[p, A]B(t)} 

= ^Tr{p[A,B(t)]} 

= pTr{ P AB(t)} 

where 

A = l (d\e**Ae-** 

P J O 

21-30. In Dirac notation the completeness relation is 
2|m> <m\ = 1 

m 

It is not really necessary to use Dirac notation in Section 21-1. Prove that the completeness 
relation can also be written as 

2 <p„*(x)<pn{x') = 8(x - x') 
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where the ip m form a complete, orthonormal set. Hint: This can be proved formally by expand- 
ing &(x — x') in terms of the set {ipmO c)}. 

21-31. Prove that the Liouville operator is Hermitian. 

21-32. Prove that the operator exp(±/Yj£?) is a unitary operator. 

21-33. Show that the solution to 
a A/ 

= -i<e o A/- F(t)(A, f 0 ) 
is 

A/(0 = - f e H, '- >se °(A,f 0 )F(t')dt' 

Jo 

where A/= 0 at t — 0. 

21-34. In this problem we shall discuss the density matrix and the quantum-mechanical 
Liouville equation. We expand the time-dependent state function in terms of some complete 
orthonormal set {^.(x)} according to 

'F( J c,r)=2c.(0&(*) 

n 

The density matrix p is defined by 
pnm “ C m *C n 

where the bar represents an ensemble average. The elements of the density matrix clearly 
depend upon the set {$ n (x)} y i.e., the representation used, but the final result of an ensemble 
average must be independent of the representation used. 

First prove that the trace of p is unity. Now prove that the ensemble average of some 
observable B is given by 


OF, m) = 2 Cm*Cn - 2 B mn Pnr, 

m.n m,n 

m 

= Tr{^p} 

where & is the operator corresponding to the observable B. Since a trace is independent of 
the representation or basis, we see that the ensemble average is also. 

Lastly, use the fact that 

'FO, <) = 2 C m (t)>pm(x) 

m 

satisfies the time-dependent Schrodinger equation 

to show that 


« - 7 = = - 2 Hn*) 

Ot k 



which is the quantum-mechanical Liouville equation, Eq. (21-242). 
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21-35. Prove that 
<a|p(/)>= <«(0|p> 

where a is a quantum-mechanical operator and p is the density matrix. 
21-36. Show that 


dt 


k 2 DF s ( k, 0 


where F,(k, t) is the Fourier transform of G(r, t), where 


dG 

— =DV*G 
dt 


21-37. Prove that 

£ dt' £ dr nr -n=it £ (i - jj/w dr 

by letting r = t" — t', changing the order of integration, and carrying out one obvious inte- 
gration. 

21-38. Derive the linearized Navier-Stokes equation [Eq. (21-283)] from the Navier- 
Stokes equation. 

21-39. Prove that 

V • (sym V • u) = JV 2 u + JV(V • u) 

and 

V * (V * u)l = V(V • u) 

21-40. Derive Eq. (21-286). 

21-41. Prove that 

JJJ dr e lU r Vp = ikpik, t ) 

||J dr c' k r V 2 u = — A: 2 J(k, t ) 

JJJ dr e* ■ r V(V • u) = - k[k • J(k, /)] 

21-42. Derive Eq. (21-305) from Eq. (21-304). 

21-43. Derive Eq. (21-307). 

21-44. Derive Eq. (21-321). 

21-45. Prove that the bulk viscosity of a monatomic ideal gas is zero. 

21-46. Derive Eqs. (21-349) through (21-351). 

21-47. Derive Eq. (21-355) from Eq. (21-354). 

21-48. Prove that the terms in brackets in Eq. (21-355) represent a correlation between 
the effects of successive collisions. 

21-49. Show that the impulsive nature of the hard-sphere potential leads to the presence 
of odd powers of t in the Maclaurin expansion of <v(0) • v(f)>. 

21-50. Derive Eq. (21-374). 
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21-51. Using the time-correlation function expressions, show that 


A* = 


A ke 1 ' 2 
m 1 , 2 o 2 


D* = 


Dm 112 

fi 1/2 C7 


1C* 


KU 2 

m ll 2 e 112 


21-52. Prove that 


f </x/x(0 + /*(')/*>* " dt = (1 + e 
J — ao 

and that 

47 T 

£"(w) = — tanh ififiw J C + {t)e~ l<ot dt 

where — 

C+(t) = ) + /x(t)^> 

= <4[m, M01 + > 

where [a, £]+ = ab + ba is the anticommutator of a and b . 

21-53. In Problem 21-52 we introduced the correlation function 
C + (t)=<4M(0), A(t)] + > 

Prove that this more symmetric time-correlation function is a real, even function of time. 

21-54. Prove that an autocorrelation function such as <v(0) • v(f)> is a stationary function; 
i.e., <v(t) • y(t + t» is independent of t. 

21-55. Use the Schwartz inequality 
J f*S dx < (J f*fdx) l,2 (| g*g dx} 
to prove that 

-</! 2 ><</l(OM(0><<>l 2 > 

21-56. Since the after-effect function is a real function, the susceptibility x(*>) must be 
complex. Prove that 

xV) = x'(-w) 

x"( w ) = — x*(— “) 
x*( a, ) = x(- £ °) 

21-57. Prove that if the external field is harmonic, i.e., if 
F(t)=Fe ,a ‘ 

then 

<m> = xHe ,at 

Since the applied field must be real, we represent a monochromatic force by 
F(t) = HFe , '* + F*e-‘ at ] 


| dt 

J m 
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Show that 

= ilxWFe 1 *' + x(-o >)F*e- 1 ”'] 

Show that this is equivalent to 

<fl(0> = @ex(aj)Fe t(0t 

If F(t) adds a term —AF(t) to the Hamiltonian, this is indicated in the above equation by 
writing explicitly 

<B(t)> = @e X BA(<o)Fe t 

21-58. Consider a system that interacts with an external field according to 
H x {t)=-AF{t) 

and that this field induces a response B(t). Since the external field also induces the system to 
undergo transitions from one state to another, there is an absorption and emission of energy. 
The difference between the energy absorbed and emitted is the energy dissipation . 

The energy dissipation per unit time per unit volume Q(<o) can be related to the susceptibili- 
ty of the system. The rate of change of the energy of the system is dHJdt, and Q(oj) is obtained 
by averaging the expectation value of this over one period of the monochromatic field. 
Show that 

co r 2nfta dF 

Using the final equation of the previous problem, i.e., 

<A(t)> = RexMFe™ 
show that 


Xaa"(w)\F\ 2 

This important equation shows that the energy dissipation is directly related in the imaginary 
part of the susceptibility. 

But from Eq. (21-159), one can also show that (do this) 

Xaa"(w) = ^fjr{p[A, A(t)]}e-‘°“ dt 

Show that this is equivalent to 

X*a"( w ) = ~r tan h f C + (t)e~ iwt dt 

where C+(t) is the symmetric autocorrelation function 

C + (0 = <*[i4(0), A(t ) ] + > 

where [ ]+ denotes the anticommutator. 

If we call the Fourier transform of C+(0, G aa (oj), we have that 

47T 

X aa"(<o) = -jr tdLnh(iPhio)G AA (aj) 

Since the dissipation is linearly related to G AA ( a>), this is often called the power spectrum of 
the process A(t). In addition, note that the energy dissipated by a system exposed to some 
external disturbance can be directly related to the autocorrelation function, which describes 
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how spontaneous fluctuations regress to the equilibrium state. The relation between Q(w) 
and G aa (oj), namely, 

2,7TCjO 

Q(w) = — tanh(i)5//co) | F\ 2 G AA (w) 

is called the fluctuation-dissipation theorem. (See Berne, 1971, “Additional Reading.”) 

21-59, A simple function that qualitatively approximates the velocity autocorrelation 
function was apparently first suggested by Douglas.* 

<f>(t) = sech(at)cos(bt) 

One can evaluate a and b by comparing the Maclaurin expansion for this to the exact expan- 
sion 


#0 - 2 

n = o 

Show that according to this approximation 

irkT /nb\ 

D = - — sechl — ) 

2 ma \2a ) 

kT l ” \ J w [( 5-01 ,,2 \ 

“ m \[2(c — l)/4 2 ] l,2 / SeCh \2 [( c - 1 )J ) 


where c = 6AJA 2 2 - Isbister and McQuarriet have compared the predictions of this formula 
to experimental data. 


21-60. Prove thatf 

x t 2 (t)Piy(t)pjA0) 


(s- 


Ms 


*« i (*)M0p./»(0 


> 


= {2x t \0) Pl Mp J m) 


= (lx.mptxo)} 

(l x,*(0)p ty (0)pj t (t)^ = (z x t 2 (0 )pi„ 2 (0)^ 

^2 xj(t)x t (t)p jy (t)p, y (t)^ = ^2 jfj(0)x ( (0)p Jy (0)p l( .(0)^ 

= (lx t \0)p ty \0)^ 

21-61. Prove that <f>(t) in Eq. (21-187) is an odd function of t . 

21-62. Verify Eqs. (21-215). 

21-63. Brenner and McQuarrie (Can. J. Phys., 49, p. 837, 1971) have presented an analysis 
of collision-induced absorption in rare gas mixtures by considering the Maclaurin expansion 


♦ /. Chem. Phys., 35, p. 81, 1960. 

t J. Chem. Phys., 56, p. 736, 1972. 

t Cf. E. Helfand, Phys. Rev., 119, p. 1, 1960. 
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of the dipole moment autocorrelation function <f>(t) = <|x(0) • n(r)>. First show that if I(w) is 
the line shape function, then 

OtY 


iiD r 

4A) = 2 — r <o"/(tu) do 

it — O //! •l—oo 


- S ^<-.r 

ft = o // ! 

“ (ity 

= 2 — ry» 

n =o nl 


<(40) • 1 40> 


1 = 0 


Show that 
72 


Ll 

_/ 

" \ dr / 


Yin* 1=0 

and in particular that 


yo = 4w 



Mr) 1 2 r*dr 


4 irkT r® 


V2 = 


w 


f® 2 2||i| 2 ) 

/.*■*“ JF +-7 T-r* 


where m is the reduced mass and u(r) is the intermolecular potential. 

Brenner and McQuarrie have evaluated y 0 , y 2 , and y 4 for a Lennard-Jones potential and a 
dipole moment function of the form 


rfr) = Poe- rK + p 

where /x 0 , £, and c are constants. They then use these three moments in a so-called Gram 
Charlier series to derive an approximate line shape I(oj). 

A Gram-Charlier series is a series of the form 


Ax) = <pto + ^ <P <3 >(jc) + g ?> <4> w + • • • 

where f(x) is some distribution function, <p(x) is a standardized Gaussian distribution function 
(i.e., one with zero mean and unit variance), and <p w (x) is its yth derivative. Show that 
<o)> = 0 and <a> 2 > = y 2 /y 0 2 so that x = co/< o> 2 > 1/2 for collision-induced absorption in rare 
gas mixtures. Show also that 


C| = (— iyf Hj(x)Ax) dx 
. 1—00 

where Hj(x) is a Hermite polynomial. 

We see that a Gram-Charlier series replaces the function f(x) by a Gaussian distribution 
and correction terms involving the Cj. Since the line shape shown in Fig. 28-9 is symmetric 
about oj = 0, the first correction, c 3 is zero. Show that the first nonzero correction to the 
simple Gaussian approximation involving c 4 is given by 


c 4 = 


nyo 

72 2 


-3 


and that the Gram-Charlier approximation to /( w) is 


/(«) 


^° T r- — ea2t2/4- l 

27T 1 ' 2 


C 4 

l+ 4i m 



where r = (2y 0 /y 2 ) 1/2 - Brenner and McQuarrie show that this gives excellent agreement with 
experiment. 



CHAPTER 22 


THE TIME-CORRELATION 
FUNCTION FORMALISM, II 


\ 


In this final chapter we shall continue our discussion of the time-correlation function 
formalism. In Section 22-1 we shall discuss the use of inelastic neutron scattering to 
probe the structure and dynamics of molecules in condensed media. The theoretical 
formalism was first presented by van Hove* in 1954. Of particular importance in 
this section is the introduction of the van Hove space-time correlation function 
G( r, t). In the second section we present some of the basic concepts and definitions of 
random functions and stochastic processes, and in particular, we derive and discuss 
the Wiener-Khintchine theorem, which relates a correlation function to its associated 
spectrum. The Wiener-Khintchine theorem is used in the next section to derive the 
basic equations for the spectrum of light scattered from a solution or a fluid. With 
the advent of the laser, this has become a powerful experimental tool. We shall see 
in that section that one can determine the diffusion coefficient and the electrophoretic 
mobility of macromolecules in solution, the hydrodynamic properties of liquids, and 
chemical kinetic rate constants from the spectrum of the scattered radiation. Then in 
Section 22-4 we shall introduce the so-called memory function associated with an 
autocorrelation function. It turns out that the memory function has both conceptual 
and computational advantages over the autocorrelation function itself, and we shall 
illustrate this with a calculation of the velocity autocorrelation function of a mon- 
atomic fluid. Lastly, in Section 22-5 we shall derive formulas for the thermal transport 
coefficients from within the memory function framework. This will more clearly 
elucidate the small-frequency and wave vector limit that was tacitly assumed in 
Section 21-8, where we derived such expressions from the appropriate macroscopic 
transport equations. We shall also be led to the concept of generalized thermal trans- 
port coefficients, which depend upon frequency co and wave vector k, and to general- 
ized hydrodynamics, the extension of hydrodynamics to arbitrary time scale and spatial 
inhomogeneities, i.e., to arbitrary co and k. 


L. van Hove, Phys. Rev., 95, p. 249, 1954. 
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t 

22-1 INELASTIC NEUTRON SCATTERING 


We shall see in this section that an analysis of the scattering of slow or thermal 
neutrons from fluids has proved to be a powerful technique in obtaining information 
concerning their dynamical behaviour and microscopic structure. The theoretical 
development of this technique is due originally to van Hove,* whose derivation is 
presented here. 

van Hove starts with the first Bom approximation (Marshall and Lovesey, 
“Additional Reading”), which says that the differential scattering cross section per 
unit solid angle and unit interval of outgoing energy e of the scattered particle (the 
neutron) is given by 


d 2 o 
dQ de 


m 


2n 2 h 6 


k * r 2 ml 

y W(K) £ p, 1 1 < i | rj \f>\ 2 S |k 2 ~ k o 2 + j2 ( Ef - Ed J 


( 22 - 1 ) 


where m, k 0 , and k = k 0 — k are the mass and the initial and final wave vectors of 
the scattered particle and k is defined by k = k 0 — k. The operators Tj represent the 
position vectors of the N particles of the scattering system (the fluid), whose initial 
and final states are labeled by i and /, and have energies E { and E f . The quantity p { 
is the statistical probability of the initial state and 


W(k) = 


| exp(iK • r)F(r) dr 


( 22 - 2 ) 


where F(r) is the interaction between the neutron and the nuclei of the fluid. We have 
also assumed (for now) that all the nuclei of the fluid interact in the same way with 
the incident neutrons. 

If, besides the momentum transfer fiK y we introduce the energy transfer through 


ho 


h\k 0 2 -k 2 ) 

2m 


(22-3) 


then Eq. (22-1) can be written in the form 
d 2 a 


dQ de 


= j4S(k, o) 


where 


, _ Nm 2 k JJU N 
A 4n 2 fi s k n W(K) 


and 


S(K, to) = N- 1 Y Pi X | <»-| /> l 2 ^(" + 


(22-4) 


(22-5) 


( 22 - 6 ) 


By introducing the Fourier transform representation of the delta function and 
proceeding much as we did in Section 21-1, this expression can be rewritten as 
(Problem 22-1) 

S(K,to) = -\- f“ dte-** V <g-« r i(°V« ^(0) (22-7) 

2nN J -a Q 


2n 



dt e~ i<at F{ k, 0 


( 22 - 8 ) 


L. van Hove, Phys. Re v. t 95, p. 249, 1954. 



INELASTIC NEUTRON SCATTERING 546 


where 


1 


F(k, 0 = -I ^ e ~ iK ri{0) e iK rj{t) y 
N UJ 


(22-9) 


The quantity S( k, o) is called the scattering function. 

We shall now express .S(k, co) in terms of a time-dependent extension of the radial 
distribution function G(r, f), which is called the van Hove space-time correlation 
function. This is defined through 


S(k, co) = (In)' 1 J e iiK ' r ““°G(r, t) dr dt 
G(r, t) = (2n)~ 3 J e~ iiK to) die dco 


( 22 - 10 ) 

( 22 - 11 ) 


Introducing Eq. (22-7) for S( k, o>) into Eq. (22-11) and integrating over f, we get 
(Problem 22-2) 

G(r, t) = (2n)~ 3 N ~ 1 £ f die e- iK \e- iK ^°V K ^y (22-1 2) 

i,j J 

We shall see that the factor of (2n)~ 3 N~ x in this expression makes G(r, t) independent 
of N and asymptotically equal to the number density in the thermodynamic limit. 
The functions F(k, t) and G(r, t) are Fourier-related by 


G( r, 0 = (2ji) -3 J dm e~ tK r F( k, t) 
F(k, t)=j dr e iK r G(r, t) 


(22-13) 

(22-14) 


Since S(k 9 co) is an experimentally observable quantity and hence real, it is easy to 
see from Eq. (22-11) that G(r, t) has the symmetry (Problem 22-3) 


G(-r,-0 = G*(r, 0 


(22-15) 


In the classical limit; the operators in Eq. (22-12) commute, and so we may write 

G(r, t) = N~ 1 8[ r + r,(0) - r/01 ^ (22-1 6) 

which is a real quantity. Thus we see that complex values of G(r, t) reflect quantum 
properties of the system. For simplicity, we shall discuss only the classical limit, 
although the complete quantal formalism is well developed (cf. Marshall and Lovesey, 
in “Additional Reading”). 

Equation (22-16) shows that in the classical limit G(r, t) splits naturally into two 
parts: one that describes the correlation of one molecule at different times and one 
that describes the correlation between distinct particles at different times. Denote 
these correlation functions by G s (r, t) (s stands for same) and G d (r, t) (d stands for 
distinct). Thus we can write 


G(r, t) = G s (r, t) + G d (r 9 1) 


where 


G s ( r, t)=N~' (jts[T + r/0) - r/0]^ 


(22-17) 


(22-18) 
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For t = 0 these reduce to 


G s (t, 0) = «5(r) 


( 22 - 20 ) 


G„(r, 0) = p 0 g(r) 


(22-21 ) 


where g(j) is the radial distribution function of Chapter 16. 

Thus we see that neutron scattering allows one to experimentally probe both the 
spatial and temporal behavior of the molecules in a fluid. On the other hand, if the 
incident energy is sufficiently large compared to the energy transfers (as it is in the 
case of X-ray scattering), the momentum transfer for a given scattering angle is 
independent of the outgoing energy, and the differential cross section per unit solid 
angle becomes 


do f d 2 o 
dQ = JdQde 


de = fiA J S(k, o) do 


( 22 - 22 ) 


= fiAje Ur 6(t)G(T,t)didt 

= [1 + Po J «*“■' 9(r) dr] (22-23) 

in agreement with the X-ray scattering formulas in Chapter 13. 

In what we have discussed so far, we have assumed that all the nuclei in the fluid 
interact in exactly the same way with the incident neutrons. Generally, however, this 
is not the case. The scattering usually depends either on isotopic effects or nuclear 
internal variables, and so the scattering is not the same from all the nuclei. In par- 
ticular, we wish to extend the above results to the case of spin-dependent scattering. 
If dj is an operator that depends upon the spin of the 7th particle, we define a spin- 
dependent space-time correlation function by 


Hr, t) = N~ 



dr' a,( 0) <5[r + r,(0) - r']a/0 <5[r' - r/t)] 


> 


where 


aft) = exp 



(22-24) 


Note that if a } = 1 and we take the classical limit, in which the operators commute, 
then T(r, t) simply reduces to the G(r, t) given by Eq. (22-16) (Problem 22-4). 

For a system of Boltzmann particles with a spin-independent Hamiltonian and 
described by a canonical distribution function p ( = e~ fiEi /Q 9 there is no correlation 
between spins nor between spins and positions, and we can write (Problem 22-5) 

F(r, 0 = <c 2 > G s (r, t ) + <a> 2 G d (r, t ) (22-25) 


where a is any of the a } and <• • •> denotes an average over the spin states of the 
corresponding particle. The interaction between a slow neutron and the nuclei of the 
scattering system is often replaced by the so-called Fermi pseudopotential (cf. Marshall 
and Lovesey, in “Additional Reading,” Chapter 1) 


n,) - (n!r ) m 


(22-26) 
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where m is the mass of the neutron and a is called the scattering length of the nucleus 
(a may depend upon the nuclear spin state). Thus we may write (Problem 22-6) 


d2 ° ^ d2(T coh } rf2g inc 

dQ de dfl de dQ de 


(22-27) 


where 


d 2 °coh 

dQde 


d 2 a inc 

dQde 


jdy 2 N k 

2nh k 0 


— \e« Kr - a,) G(T,t)drdt 


«a 2 > - <a> 2 )N k 


2nh 


k 0 


JV ( ’ cr " c,r ) Gs (t 9 t)drdt 


(22-28) 

(22-29) 


Equations (22-28) and (22-29) are called coherent and incoherent scattering cross 
sections. 

Note that in the static approximation discussed around Eq. (22-23), we have that 
(Problem 22-7) 


^ = <a> 2 ^[ 1 + p 0 J> r 0 (r) *] 

= «* 2 > - <a> 2 )N 


(22-30) 

(22-31) 


Equation (22-25) assumes that there is no correlation in the positions of different 
scattering lengths. The distinction between coherent and incoherent scattering can be 
seen through the following simple example. First, assume that all the nuclei in the 
sample interact in exactly the same way with a neutron beam. There will be interference 
between the scattered waves from each nucleus, just as in X-ray scattering. This is 
what is referred to as coherent scattering. Suppose now that the sample consists of 
two types of scatterers and that one interacts with a neutron with an interaction +i, 
the other with energy —b, and that there are equal numbers of these species randomly 
distributed through the sample. Now since there is no correlation of the scattering 
amplitudes in space, on different nuclei, the spectrum does not contain any information 
on the collective modes, only on the motion of the individual scatterers, or the so- 
called single particle motions, or self-correlation functions. This is incoherent scat- 
tering. Coherent scattering contains both self-correlation and contributions from 
other particles being at the space-time point (r, t). 

Some samples are almost purely coherent scatterers — all the nuclei have the same 
scattering amplitude b. (The cross section is proportional to b 2 ). A good example of 
this is argon-36. Some, however, have nuclei spin effects or nuclear internal variables. 
Hydrogen is the prime example of this. A proton has vastly different scattering 
amplitudes b, depending on the orientation of the nuclear spin. Thus it has a large 
incoherent cross section (around 80 barns, 1 barn = 10" 24 cm 2 ), while the coherent 
cross section is less by a factor 40. Thus it is usually difficult to observe the collective 
modes but easy to see single-particle motion in H-containing samples. 

We see from these equations that an experimental determination of the incoherent 
scattering function S inc (K , coi) over a sufficiently broad region in k~co space would 
allow one to determine G s (r, t) by Fourier inversion. There are few experimental 
studies that are extensive enough to allow this to be done. We shall discuss a few such 



548 THE TIME-CORRELATION FUNCTION FORMALISM, II 


studies shortly, but before doing this we shall discuss some approximations to G s (r, t) 
and its Fourier transform F s (k, t). 

Consider a classical system, for which we can write 

F s (k, 0 = <y*- t'i(')-n(o )}y (22-32) 

Take the direction of k to be the x-axis and write 


F s ( K, o = *'> (22-33) 

Now using the method of cumulants (Problem 22-8), this can be written as 

F s (k, t ) = exp[— K 2 y 1 (r) + K 4 y 2 (/) + • • • ] (22-34) 

where 

yi(0 = f (t “ O <jc 1 (0)j^ 1 (/ / )> A 9 (22-35) 

J o 

= i<[ri(0-ri(0)] 2 > (22-36) 

and where expressions for y 2 (f), etc., have been given by Rahman, Singwi, and 
Sjolander* in terms of higher-order velocity correlation functions. They also show 
that for small times as well as long times, these higher terms can be neglected, giving 

F s (k, 0 = exp[— K 2 yj(0] (22-37) 

as the often-used Gaussian approximation for F s (k, t). The self-space-time correlation 
function corresponding to this is 


G s (r, 0 = [4^(0]“ 3/2 



(22-38) 


Equation (22-37) can be extended to give non-Gaussian corrections (Problem 22-9) 


F s (k, t) = + ia 2 («XK 2 y,(0] 2 + — } (22-39) 

where 

a M = jjrp^2-l (22-40) 


Note that a 2 (t) = 0 for a Gaussian form for G s (r, t). 

Equation (22-36) shows that y x (t) = Dt for a diffusing particle, and so we see from 
Eq. (22-38) that 


G s (r, 0 = (47r£>0 _3/2 exp 



(22-41) 


and that 

£inc(*. 


£>K 2 /jt 


(O 2 + (Dk 2 ) 2 


(22-42) 


We expect that this approximation to be most valid at small frequency since the form 
of G s (r, 0 is most valid at long times. Figure 22-1 shows the experimental S inc (x, 0) 


* Phys. Rev., 126, p. 986, 1962. 
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Figure 22-1. <S(x, 0) versus k 2 . The solid line shows the simple diffusion result. (From K. Skold et al., 
Phys. Rev ., 6A, p. 1107, 1972.) 

of liquid argon plotted versus k 2 along with Eq. (22-42) with D = 1.94 x 10" 5 cm 2 /sec. 
The agreement is seen to be very good. 

Although an exact calculation of y x (t) requires a knowledge of the velocity auto- 
correlation function, Eq. (22-36) shows that for short times 

kT 

ri(0=Kv-V>f 2 =— t 2 f-0 (22-43) 

This displays the initial free motion that the particle undergoes before colliding with 
another particle. Thus although the exact behavior of y x (f) is difficult to calculate for 
all times, we see that it goes as f 2 for short times and t for long times. 

Since 


F s (k, 0 = f e i<at S inc (K, m) dco (22-44) 

J — 00 

it can be determined by a Fourier inversion of the experimentally determined inco- 
herent cross section. Then using the form of F s (k, t) given by Eq. (22-39), one can 
determine y x (t) and a 2 (t) numerically from the data. This requires high-quality data, 
but Skold et al* have carried out a thorough study of liquid argon. They present a 
detailed discussion of the experimental procedure, and their paper is highly recom- 
mended. Figure 22-2, which comes from their paper, shows the numerically determined 
values of y t (t) and a 2 (f) as functions of time. It can be seen that y x (t) goes as t 2 for 
short times and Dt for long times as one expects. [The term ti 2 f$mkT that occurs along 
with the f 2 -term in the figure arises from the fact that the authors have Fourier-inverted 
a symmetric scattering function S(k 9 cd) = e~ h<a,2kT S(k, cd) instead of just S(k, cd) 
itself.] Figure 22-3, taken from Dasannacharya and Rao,f shows a plot of G s (r, t) 
versus r for various values of t. 


♦ Phys. Rev., 6A, p. 1107, 1972. 
t Phys. Rev., 137, p. A417, 1965. 
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Figure 22-2. y x (t) and a 2 (0 (scale to the right) determined experimentally for liquid argon by Skold 
et al. The limiting behavior of y x (t) at small and large times as obtained for a free gas and 
for simple diffusion, respectively, is also shown. The solid line in the a 2 (t) plot shows 
the result obtained by Levesque and Verlet, and the circle shows the result obtained by 
Rahman for a 2 {t) at t = 2.5 x 10“ 12 sec for argon at 94.4°K. (From K. Skold et al., 
Phys., Rev., 6A, p. 1107, 1972.) 



Figure 22-3 . Self-correlation function of liquid argon at 84.5°K for different times (in units of 10“ 1 3 sec)., 
The ordinate on the right-hand axis of (a) applies to t - 2. (From B. A. Dasannacharya and 
K. R. Rao, Phys. Rev., 137, p. A417, (1965.) 


It was mentioned in Chapter 1 1 that neutron scattering can be used to probe the 
distribution of vibrational frequencies in a crystal. First let us calculate y x (t) for a 
one-dimensional harmonic oscillator. In this case (Problem 22-14) 

<*l(0)*l(0> = <*1 2 > COS O)o t 
kT 

= COS COn t 

m 


(22-45) 
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and 


, v kT{\ — cos co 0 f) 

= 

For a harmonic lattice with a distribution of normal modes g(<o) d<o 9 then 
kT r°° / 1 — cos (of 


dco 


, . kT r m f 1 - COS (Ot\ 


But if we define a quantity /(co) through 
/(<») = 3^, / dt e iat (y(P) • v(f)> 

and use the fact 

y&) dt' (t - ?) <v(0) • v(/')> 


then 


1 f* 3fcT r® 

Jl(,) “ 3 J, ‘'“‘““ft®) 

m J 0 \ (O / 


(22-46) 


(22-47) 


(22-48) 


(22-49) 


(22-50) 


On comparing Eqs. (22-50) and (22-47) we see that /(co) [as defined by Eq. (22-48)], 
is the analog of the distribution of vibrational frequencies in a harmonic lattice. 

It is possible to determine /(co) in terms of the incoherent scattering function 
S inc (x, co). From Eq. (22-34), we have that 


F s (k, t) = l— K z y t (t) + 0(k a ) 
from which we write 

= - k 2 ViW + OQc*) 


(22-51) 


(22-52) 


and that 


MO) = - lim 


i a 2 F. 


K -.0 k 2 dt 2 
But Eq. (22-44) shows that 

F s (k, t ) = f“ e^‘S inc (K, to) da> 

J — CO 

and so 


(22-53) 


(22-54) 


1 d 2 F s 


\\ day e iou ay 2 S inc (K, ay) 

K d — qq 


(22-55) 


From Eq. (22-36) we see that 
MO = i<v(0) • v(0> 


(22-56) 
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and so 


By comparing this equation with Eq. (22-55), we see that 


(22-57) 


(22—58) 


Thus S inc (K , coi) contains information concerning the analog of the distribution of 
normal modes in liquid. Rahman* has calculated a normalized version of /(o>) by 
molecular dynamics, and the result is shown in Fig. 22-4. Also shown in this figure is 
/(co) according to the Langevin equation. In this case 


<v(0) • v(0> 


m 


(22-59) 


where £ = kTjmD and so 


f(v) = - I e & cos cot dt 
n Jq 

2 C 

n £ 2 + co 2 
Note also that 


(22-60) 


m 


2 mD 
nkT 


(22-61) 


which gives a direct relation between D and /( 0). 



hcj IkT 

Figure 22-4. Spectrum of the velocity autocorrelation function. The Lorentzian spectrum of a Langevin- 
type correlation is also shown. (From A. Rahman, Phys. Rev., 136, p. A405, 1964.) 


Phys. Rev., 136, p. A405, 1964. 
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Inelastic neutron scattering has proved to be a powerful tool to probe the motion 
of molecules in both fluids and solids. We have discussed only classical, monatomic 
fluids in this section, but a great deal of both theoretical and experimental work has 
been done on quantum fluids and polyatomic fluids. The works by Marshall and 
Lovesey and by Larsson, Dahlborg, and Skold (“Additional Reading”) discuss these 
extensions in some detail. 

We shall see in Section 22-3 that the spectrum of scattered light also can be used 
to probe molecular motion (although for different magnitudes of k and coi) and that 
the theoretical formalism is quite similar to that of neutron scattering. Before discus- 
sing light scattering, however, we must digress to discuss a well-known theorem of 
stochastic processes known as the Wiener-Khintchine theorem since this theorem 
plays a central role in the derivation of the basic light-scattering equations. 

22-2 THE WIENER-KHINTCHINE THEOREM 

Consider some random process described by the function x(t). This might be the 
trajectory of a particle undergoing Brownian motion, the output of some “ noisy ” 
electronic device, the voltage across a biological membrane, etc. If we repeat this 
process N times, we obtain a set of functions x(f, a), a = 1, . . . , N. We can think of 
these as N strip recordings as shown in Fig. 22-5. 

The mathematical abstraction of this is to consider a family of functions {;t(f, a)} 
where a may be continuous as well as discrete. 

Now if a is taken to be a random variable with a probability density p(a)dbc, then the 
family of functions {x(t, a)} is a stochastic process. A stochastic process can be thought 
of as a function of two variables t and a, where t is time and a is a random variable. 
Roughly speaking, the random variable X(t) does not depend upon time in a com- 
pletely definite way, but only in some probabilistic sense. 

More precisely, the random process X(t) is defined or described by a set of probabil- 
ity distributions. At any given time, find the fraction of the total number of functions 






Figure 22-5. Strip recordings of random functions. 
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that have a value between x and x + dx. This fraction, w t (x 9 t) dx, is called the first 
probability distribution. Now define the second probability distribution, w 2 (x u q ; 
x 2 , h) d x 2 » as the joint probability of finding X between jq and jq + dx x at time q 
and between x 2 and x 2 + dx 2 at time t 2 . This process can be continued on through 
the third, fourth, and all subsequent probability distributions. This set of functions 
completely characterizes the random process in a statistical sense. If we know the 
functions for all j\ we know all that can be known about the random process. The 
functions Wj must satisfy certain obvious conditions: 

(1) w H > o 

(2) w n (x t ,t 1 ;x 23 t 2 ;--;x n9 t n ) is a symmetric function in the set of variables 
(x j9 tj). This follows because w n is a joint probability. 

(3) w k (x u t L ; ; x k , t k ) = J J dx k ^ ■ dx„ w„(xi, t 1 ;--- ;x n , t„). 

In general, the complete determination of the above set of probability distributions 
is not feasible. For example, the determination of just vq(jt, t) would require that we 
determine the number of observations that X lie within x and x + dx for all values of t. 
Thus we would have to observe the time evolution of a large number of similarly 
prepared systems. Fortunately, however, there are several reasonable assumptions 
that can be made that greatly simplify matters. The first and least restrictive assump- 
tion that we shall discuss is to assume that the random process is stationary in time. 
By stationary we mean that the form of the probability distribution functions does 
not depend on a shift of the origin of time. In a sense we assume that the underlying 
probabilistic mechanism of the process does not change with time. More precisely, 
we say that a random process is stationary when the probability distributions of 
{jc ( f, a)} and {x(f + t, a)} are the same for any t. 

For a stationary random process then, we may, in principle at least, determine the 
various probability distributions from the experimental observation of x[t) for one 
system over a long period of time. This long-time record can be cut up into pieces 
of length T (where T is much longer than any “ periodicities ” occurring in the process), 
and these pieces may be treated as observations of different systems in an ensemble of 
similarly prepared systems. The underlying assumption here is the so-called ergodic 
hypothesis , which states that for a stationary random process, a large number of 
observations made on a single system at N arbitrary instants of time have the same 
statistical properties as observing N arbitrarily chosen systems at the same time from 
an ensemble of similar systems. 

Since a shift in the origin of time does not affect any of the statistical properties of 
a stationary random process, the probability density functions simplify to 


General 

Stationary 

Wi(x, t) dx 

WiC*) dx 

w 2 (x l .t 1 \X2yt 2 )dx l dx 2 

w 2 (x l9 x 2 \t 2 — ti) dxi dx 2 

>v 3 (*i, ti ; x 2 , t 2 ; x 3 , t 3 ) dx x dx 2 dx 3 

w 3 (x u x 2 , x 3 ; t 2 — t u t 3 — ti) dxi dx 2 dx 3 

etc. 



Lastly, we note one more property that follows from the ergodic hypothesis and 
the assumption of stationarity. In dealing with general random processes, there are 
two types of mean values that we encounter. One is obtained by observations made 
on many systems at some fixed time t (denote this average by <x», and the other is 
the time average made on one system as a function of time (denote this average by x). 
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For a stationary random process, both averages yield the same result. In terms of 
equations, we have 


1 r T 

x = lim — x(t) dt 

T -* oo 21 J -T 

r 00 

<x> = xw^x) dx 

* - m 


(22-62) 

(22-63) 


and for a stationary random process 

x = <x> (22-64) 

according to the ergodic hypothesis. 

The probability distributions of stationary random processes are indeed simpler 
than the general case, but it is necessary, nevertheless, to make further restrictions as 
well. This leads us to an important classification of random processes. A random 
process is said to be purely random when values of x at different times are completely 
uncorrelated. The probability distributions in this case become 


w 2 (x 1 , h,x 2 , t 2 ) = Wifo, , t 2 ) 

W'sfo, h;x 2 ,t 2 ;x 3 , t 3 ) = w, (x t , tjwfa , t 2 )w 1 fx 3 , t 3 ) 


(22-65) 


etc. The random process is completely specified by w v Purely random processes do 
not occur often in physical applications since in most real situations x(t x ) and x(t 2 ) 
will be correlated at least for small values of 1 1 2 — t t | . 

The next more complicated case, and one that turns out to be a reasonable abstrac- 
tion for a large number of physical processes, is to assume that the process is a 
Markov process . In order to define a Markov process, first consider that the time axis 
is divided into small intervals of length <5, and let tj = j8. We now introduce the 
conditional probability p„(x n , f„|x 0 , t 0 ; x u t 2 ; . . . ; x n _ u t n _ 2 ), t 0 <t 1 <- ••<#„, 
that X lie in the interval (x n , x n + dx n ) at time t n given that X(t 0 ) = x 0 , X{t x ) = x l9 
etc. We define a Markov process by the requirement that 


Pnfa n y t n \XQ y ^0* • • • * 1* l) ^n\^n— 1* ^n-l) (22 66) 

In other words, the probability that the system “is in the state n” at time t n depends 
only upon its state directly preceding t n and not upon the previous history of the 
process. Since all the p n reduce to p 2 , we see that a Markov process is completely 
characterized by the second distribution function. 

There are two functions associated with a continuous stationary random process 
that are central to the theory of stochastic processes. These two functions are the 
correlation function and the spectral density. We shall see the stationary random 
processes can be well characterized by either of these two functions. The correlation 
function of a continuous stationary random process is given by 


C( T) = lim 

T - co 21 



X(t + T )x(t) dt 
T 


(22-67) 


According to the ergodic hypothesis, C(t) also is equal to <x(f + x)x(t)>, i.e., an 
ensemble average of x(t + t )x(t) instead of a time average. We write this as 


vu 

C( t) = JJ x 2 x 2 w 2 (x u x 2 \x) dx t dx : 


( 22 - 68 ) 
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Note that if x r and x 2 are uncorrelated, then w 2 (x u x 2 ; z) = m , 1 (x 1 )m'i(x 2 ) and 
C(t) = 0 if <xj) = 0. 

On physical grounds, we expect that if x(t) is indeed a random process, x 1 and x 2 
will be more correlated for small z than for large z. In fact, we can prove that 

-C(0)<C(t)<C(0) ( 22 - 69 ) 

The proof goes as follows: Start with the obvious inequality 

[x(0 ± x(t + r)] 2 > 0 


or 

x 2 (t) + x 2 (t + t) ± 2x{t)x(t + t) > 0 

If we take either the ensemble or time average of both sides of this inequality, we get 
2C(0) > ±2 C(t), which is Eq. (22-69). 

One can also show that C(z) is an even function of z since 

C(t) = x(t)x(t + r) = x(t - z)x(t) 

= x(t)x{t — t) = C(— t) ( 22 - 70 ) 

The other central function in the theory of stochastic processes is the spectral 
density of x(t). Let 

x T (t) = x(t) for - T < t < T 

= 0 otherwise ( 22-71 ) 

Let the Fourier transform of x T (t) be A(to), i.e., 

A(w) = f x T (t)e~“* dt = f x(t)e~“°‘ dt ( 22 - 72 ) 

J -co J -T 

so that by Fourier inversion 

1 00 

*t( 0 f A(a>)e i(0t do ( 22 - 73 ) 

271 J -oo 

By Parseval’s theorem (Problem 22-16) we can write 


{ x T \t)dt = Y \A(fi))\ 2 do} 


2k - —oo 

This can be proved by writing 


( 22 - 74 ) 


f \A(o )\ 2 do>= $ A*(o>)A(d) do = f A*(<jj>) f x T (t)e ia * dt d<o (22-75) 

00 00 ^ — CO CO 

and interchanging orders of integration. By dividing Eq. (22-74) by 2Tand taking the 
limit, we get 


lim ^ f x t 2 (0 dt 

T-+ co J -T 



\A(o )\ 2 

IT 


do 


( 22 - 76 ) 

( 22 - 77 ) 
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Now we define the spectral density S(to) of x(t) by 


S(oi) = lim ^-\A(o)\ 2 

r->oo 

so that 

x 2 = f 5(o>) day 
2n 


(22-78) 


(22-79) 


By the ergodic hypothesis, S( ay) is the same for all x (t) in the ensemble. Note that since 
x(t) is real, A*(ay) = A(—eoi), and S( ay) is an even function of ay. 

This relation has an important and useful physical interpretation. If we consider 
x T {t) to be an electric current, then x 2 is the average power dissipated as the current 
passes through a unit resistor during the interval (—T y T). Thus S(ay) day/ln is the 
average power dissipated with frequencies between ay and ay + day; hence S(aol) is 
often called the power spectrum of the random process x(t). 

It turns out that the correlation function and the spectral density are connected by 
the so-called Wiener-Khintchine theorem , which states that 


and 


C(r) = 2- f °° S(co)e‘" r d(o = ^-C S(w) cos cot dto 

2n “ — oo 2 jt " — oo 

S(py) = f °° C( x)e~ i<ax dz = 2 f °°C(t) cos cot dz 


(22-80) 


(22-81 ) 


Thus according to the Wiener-Khintchine theorem, the correlation function and the 
spectral density are simply Fourier or cosine transforms of each other. This is a 
useful relationship since S(ay) is often experimentally measurable and this spectrum 
can be inverted to determine the correlation function. 

We shall prove the Wiener-Khintchine theorem in two ways, one using the notation 
of Fourier series and the other using the notation of Fourier transforms. 

Let x (0 be represented by a Fourier series over the interval (— T y T): 


*(0 = 



tint ® , . tint 

— + EM'" — 

* n= 1 i 


(22-82) 


There is no constant term a 0 here since 


0o 


= hl. T x{t)dt=x{t) 


(22-83) 


and we assume that the average value of x(t) is zero. Using the fact that [cos(nnt/T)] and 
[sin(«7if/7’)] are orthogonal in (—7’, T), we get that 



dt 


= Z («n 2 + K 2 ) (22-84) 

n= 1 

We can interpret a 2 + b 2 to be the power dissipated at the frequency o n = nnjlT. 
From Eq. (22-79), we can make the correspondence 


oo -oo 

2>„ 2 + 0-f 

n= 1 J 0 


S(py) day 


n 


as T -► oo 
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Similarly, we can evaluate the correlation function (Problem 22-18) 


1 r T 

C(t) = x(t)x(t + T) = — J x(t)x(t + x) dt 


nnz 


Z(a„ 2 + V)cos — 

n= 1 i 


(22-85) 


Now as T-> oo , we can write this as 
1 

L 

1 00 

= — f S{a})e im do 
2 71 j __ 


1 r°° 

C(t) = — S(co) cos cot dz 

71 Jn 


(22-86) 


(22-87) 


where we have used the fact that S(co) is an even function of co. By Fourier inversion 
we get 


S( co)= f C(T>T itot rfr 

^ — oo 

r 00 

= 2 C(t) COS COT G?T 

J n 


since C(t) is an even function of t. 

For the second proof of the Wiener-Khintchine theorem, we start with 


1 r T 

CiU) = J *r(0 x T (t + *) dt 

so that 

C(t) = lim C t (t) 

T-»oo 


The Fourier transform of C t (t) is 

f °° C T (r)e~ im dx = ^-f e~™ f°° x T (t)x T (t + x) dt dx 

•'—00 00 ^ — oo 

= 2 ^ dt dx 

= hf- XT ^ a 'f_ Xr(t + dt dx 


= -L X(c>M(-<o) = ^ 


Passing to the limit T-*co then and using Eq. (22-78), this becomes 
r C(r)e“ tor = S(to) 

** — m 


( 22 - 88 ) 

(22-89) 


(22-90) 


(22-91) 


(22-92) 


(22-93) 


or 


S(co) = 2 f °°C( t) 
J o 


COS COT Jt 


(22-94) 
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In many physical applications it is a good approximation that all the probability 
distributions are Gaussian. A process in which they are all normal is called a Gaussian 
or normal process. 

The discussion of Gaussian processes is most conveniently started with the Fourier 
series representation of a stationary random function x(t). Let T be a sufficiently 
long time and suppose that x(t) is periodic with a period 2 T. Then we can write 


x(t) = £ (aj cos Ojt + bj sin co y f) (22-95) 

j= i 


where coj = nj/T. Again, there is no constant term since we assume for convenience 
that the mean value of x(t) is zero. 

The coefficients a } and b } are random variables, and we assume that they are all 
mutually independent, have normal distributions with zero means, and that the 
variances of d s and bj are both equal to of. Thus we can write 


oo i r (a 2 4 - b 

p(a x , a 2 ..,b k , b 2 ,...)= J[I ex P [ 'la 2 J 

Note that 

<«,•> = <bj> = 0 
<.aja k y = <jbjb k y = Oj 2 6 Jk 
<ajb k >=0 

In addition, we have 


(22-96) 


(22-97) 


<x 2 (0> = Y « a /> cos 2 "j* + < b /> si ° 2 

j= 1 

oo l *00 

Y Oj 2 = a 2 « — S(o>) do) (22-98) 

j= 1 2jl ^ — 00 


where the next to the last step follows as T-> oo . 

In Problem 22-19 one proves that since x(t) is a linear combination of the dj and 
bj , x(t) itself is a normal process with distribution 


H'lW = 


1 

y/2na 2 



(22-99) 


Note that time does not appear in this distribution since X(t) is assumed to represent 
a stationary random process. 

For the second probability distribution, we consider 


x(h) = £ (aj cos a>j q + bj sin a>j t x ) 
j= i 

oo 

x(t 2 ) = Y (dj cos a>j t 2 4- bj sin aij t 2 ) 

j-i 

from which we obtain (Problem 22-20) 

00 

<x(i 1 )x(i 2 )> = Y «°/> cos <Vi cos Mj* 2 + <h/> sin Ojt y sin a> } t 2 ) 
j= 1 

oo l »oo 

Y of COS Oj T ~ — — S(co) COS COT d(D 

j=l 271 ^ — oo 

= C(r) = a 2 p(x) 


( 22 - 100 ) 


(22-101) 
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where t = t 2 — h as before and p(i) is a normalized correlation function defined by 
<jc(0)x(t)>/<jc 2 (0)>. Note that <jc(^)jc(^ 2 )) depends only upon t, as it should for a 
stationary process. 

According to Problem 22-21 the second probability distribution function is 
, 1 [" ( jq 2 + x 2 2 — 2px 1 x 2 )“( 

w 2 (.Xi, x 2 ;t)= 27K7 2 (1 _ p2 y/2 ex P [ 2a\l - p 2 ) J (22_1 ° 2) 

The third probability distribution function is a three-dimensional normal distribu- 
tion which depends on x u x 2 , and x 3 and t 2 — h and r 3 — t v In fact, all the distribu- 
tion functions are normal and depend only upon the x/s , a 2 , and p(Tj) where 

T y = tj - t u j>2. 

We see then that for a Gaussian random process, all the distribution functions can 
be determined in terms of the correlation function. In many important physical 
applications, the spectral density is readily determined, and then in turn the correla- 
tion function, and then the probability distribution functions. 

In the next section we shall need to discuss the correlation function 

C(0 = (x*(t)x(t)x*(0)x(0)> (22-1 03) 

for Gaussian random process. In order to discuss this, it is best to write x(t) in a 
Fourier representation of complex exponentials rather than sines and cosines; i.e., 
write 

*<0= Z V“"' (22-104) 

71 = — CO 

For a Gaussian process 

= a„ 2 S nm (22-105) 

and (Problem 22-24) 

( c i* c j c k* c i) — $ki + Of 2 ° j 2 3 ii Sjk (22-1 06) 

Using these results, it is easy to see that 

<x*(t)x(0)>= £ o 2 *-™*' (22-107) 

j-~ CO 

and that (Problem 22-25) 

<x*0)x(0x*(0)x(0)> = IE cjW + Z I OjWe*”'-"* 

j k j k 

= | <x*(0)x(0)> | 2 + | <x*(0x(0)> | 2 (22-1 08) 

This expression is used in the interpretation of certain light-scattering measurements 
to be discussed in the next section. 

There is an important theorem in the theory of Gaussian random processes that 
says that a one-dimensional Gaussian random process will be Markovian only when 
the correlation function is of the form e~ ax . This is known as Doob’s theorem (for 
proof of Doob’s theorem, see Wang and Uhlenbeck in “Additional Reading”). 
It also follows from Doob’s theorem that the spectrum must be of the form const./ 
(o 2 + a 2 ). This is often called a Markovian spectrum and is one of the most common 
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power spectra that occur in practice. Doob’s theorem can be generalized to w-dimen- 
sions. Instead of a single correlation function, one must deal with a correlation matrix 
whose elements are given by 

C fj <T) = <Xi(t)Xj(t + t)> 
and which can be written as 

C(t) = exp ( Qt) (22-1 09) 

where Q is some (not generally symmetric) matrix. In this case, the power spectrum 
need not be of the form const. /(co 2 + fi 2 ), but instead different forms of Q will yield 
different forms for the spectra. 

Now that we have discussed some of the more important concepts of random 
variables and stochastic processes, we can derive the basic formulas of light-scattering 
spectroscopy by appealing directly to the Wiener-Khintchine theorem. 


22-3 LASER LIGHT SCATTERING 

It is a standard technique to use light scattering to determine the shape and size of 
polymer molecules in solution. This is done through a measurement of the intensity 
and the angular dependence of the scattering. With the development of lasers, it has 
become possible to determine the spectrum of the scattered light, and we shall see in 
this section that the spectrum contains information related to the hydrodynamic 
properties of the scatterers, such as the translational and rotational diffusion coeffici- 
ents. The theoretical analysis of the scattered spectrum was first presented by Komarov 
and Fisher* * * § and Pecoraf independently in 1963. In 1964, the spectrum of laser light 
scattered by dilute solutions of polystyrene latex spheres was observed and was found 
to exhibit a lineshape in good quantitive agreement with the theory.! 

In this section we shall present a derivation due to Cummins et fl/.,§ which utilizes 
the Wiener-Khintchine theorem. Consider a volume v, which contains N identical 
scatterers in a nonscattering solvent. If we irradiate this scattering volume with a 
monochromatic plane wave of frequency o 0 , the light scattered through an angle 6 
by the jth scatterer is given in complex notation by (Problem 13-47) 

= Aj(t)e i2lt<t>J e~ ic30t (22-1 1 0) 

where Aj(t ), the amplitude of may depend upon the orientation of the scatterer. 
According to Eq. (13-97) 

2t i(}>j = ~t~ (u 0 u) • Ty = k • Tj (22-111) 

/ 0 

where n is the index of refraction and A 0 is the wavelength of the radiation in the 
solution (mostly solvent). The magnitude of k is given by Eq. (13-98) to be k = 
(47r/i/A 0 )sin(6/2). 

* L. I. Komarov and I. Z. Fisher, Soviet Phys. t JETP t 16, p. 1358, 1963. 

t R. Pecora, /. Chem. Phys. 3 40, p. 1604, 1964. 

t H. Z. Cummins, N. Knable, and Y. Yeh, Phys. Rev. Lett., 12, p. 150, 1964. 

§ H. Z. Cummins et al. 3 Biophys. J., 9, p. 518, 1969. 
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For the total scattered field observed at a large distance R from the scattering 
volume, we have 

N 

= £ Aj(ty K T 'e- ia>ot ( 22 - 112 ) 

J= i 

The total average scattered intensity is given by I s = <|<? s | 2 >, where the angular 
brackets denotes a time average as in Problem 13-47. 

It is perhaps appropriate at this point to present a brief digression on the light- 
scattering experiment itself. There are two experimental techniques that are used to 
determine the extremely small-frequency shifts that result. These involve the so-called 
optical mixing techniques,* in which the scattered spectrum is mixed with itself 
( homodyne method) or mixed with part of the incident beam {heterodyne method ). 
Since the interpretation of experimental results depends upon which method is used 
to determine the scattered spectrum, it is important to be aware of the difference. 

In a measurement of the homodyne type, the intensity of the scattered radiation is 
observed at the cathode of a photomultiplier. The output of the photomultiplier is 
usually analyzed by either a real-time autocorrelator or a spectrum analyzer. 

If a real-time autocorrelator is used, the time autocorrelation function of the output 
current of the photomultiplier 

m = mmy 

is measured. If a spectrum analyzer is used, the Fourier transform of C(t) 

cw-lf e ~^ c ^ dt 

is measured. Since the photomultiplier current i(t) is proportional to the intensity of 
the scattered radiation, we have that 

C(0 <X <<? S *(0^(0^*(0K(0)> 

°C<K(0| 2 W)| 2 > (22-113) 


It is customary at this point to assume that the scattered field is a Gaussian random 
process. According to Eq. (22-108), then, we have 


C(0oc|/(0)| 2 + |/(0| 2 

(22-114) 

where 


m = <K«>)r> 

(22-115) 

and 


m = <^o^o» 

(22-116) 


The term involving 7(0) represents a dc background that can be subtracted out, so 
that the I{t) term which contains the spectral information of interest, results. Thus in 
the homodyne method it is the quantity | I(t) | 2 = | <<^*(0^(0)) | 2 that is measured 
experimentally. 

In the heterodyne method, a portion of the incident radiation is mixed with the 


H. Z. Cummins and H. L. Swinney (“Additional Reading”); H. Z. Cummins et al., Biophys. J. 9, p. 518, 
1969. 
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scattered radiation before being led into the photomultiplier. The relevant correlation 
function in this case is 

C(t) oc < I <?o(0 + <#) 1 2 I <a0) + #J(0) 1 2 > (22-1 1 7) 

When this is written out, there are 16 terms, 10 of which vanish, 3 of which are time- 
independent terms which can be subtracted out as in the homodyne method, and the 
remaining 3 terms are proportional to (Problem 22-26) 

<|«?o| 2 > [e i “°'<«f s *(0)<f s (0> + *-^<<? s (0)<? s *(0>] + 0(0 (22-1 1 8) 

We have assumed here that |<? 0 1 > | S s \ . Thus we see that in the heterodyne method, 
the correlation function is essentially that of the scattered field. 

In order to compute the spectrum of the scattered light, we use the Wiener- 
Khintchine theorem : 




C( z)e l ™ dz 


( 22 - 119 ) 


where C(—t) = C*(t) since 7(co) is real. In the heterodyne method, the autocorrela- 
tion function C( r) is [(cf. Eq. 22-118)] 

C(t) = x <S*{t)S s (t + t)> + + t)> (22-1 20) 


where the angular brackets denote a time average or, by the ergodic hypothesis, an 
ensemble average as well. 

If we substitute Eq. (22-112) into + t)> we get 

<<COKs(' + *)> = (£ Aj*(t)e- iK ^e iwot £ Alt + T y^^^e~ iC)o(t+z) 

V= 1 «*= 1 

( 22 - 121 ) 

If we now assume that the scatterers are independent, that the position and orientation 
factors are independent, that the N scatterers are identical, and that the correlation 
function is stationary, we get 

(S 9 *(t)g s (t + t)> = Ne- ici °\A*(0)A(z)) <*-*■• *«»**• *«> (22-122) 

and (Problem 22-27) 


7(co) = - f e^C A (r)C R (z)dx 

71 J 


where 


C A ( T) = <A*(0)A(z)) 

C R ( z) = <e" iK * r(0 V* K * r(t) > 


( 22 - 123 ) 


( 22 - 124 ) 

( 22 - 125 ) 


The separation of the orientation and position is not rigorously correct and should 
be considered as a simplifying assumption. 

If the scatterers are spherical, then the scattering amplitude is independent of time 
and 


7(o>) 


NA 2 



e™C R {z) dz 


71 


( 22 - 126 ) 
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Note that C R ( r) is simply the function F s ( k, t) defined in Section 22-1. In terms of the 
self-part of the van Hove space-time correlation function, we have 

F s (k, t) = | e‘ K I G s (T, z) dr (22-1 27) 

G s (r, t) is the ensemble averaged conditional probability that a particle located 
initially at the origin will be in a volume dr at r at time t. If the scattering volume is a 
dilute solution of macromolecules dissolved in a (nonscattering) solvent, G s (r, t) is 
given by the diffusion equation 


dG s 

Sr 


DV 2 G s 


(22-128) 


with the initial condition G s (r, 0) = <5(r). Thus 



GJS, z) = (4nDz) 3/2 exp^- 

(22-129) 


C R (z) =exp(— k 2 Dz) 

(22-130) 

and 

C(z) = NA 2 e~ K2Dt 

(22-1 31 ) 

and 


(22-1 32) 


Thus the Rayleigh scattered light has a Lorentzian lineshape centered at o = 0 
with a half-width A co lj2 = Dk 2 . One can determine the value of D by plotting Ac o 1/2 
versus k 2 = 4 k 0 2 sin 2 0/2, and this is now a fairly standard experimental procedure. 

In 1971, Ware and Flygare* presented an interesting extension of the above experi- 
ment. They show that one can determine the electrophoretic mobility of a charged 
macromolecule by observing the spectral distribution of light scattered from solutions 
of charged macromolecules under the influence of an electric field. The electro- 
phoretic mobility is defined by v = uE , where v is the terminal velocity of the macroion 
in the presence of the field. Consider now the time dependence of G s (r, t) under the 
condition of a drift in the direction of the electric field. If we take the direction of the 
field to be the x-direction, then the diffusion-type equation for G s (r, t) becomes 


dG, 

dr 


= D V 2 G S + uE 


Mb 

8x 


(22-133) 


where u is the electrophoretic mobility of the charged macromolecule. This equation 
can be readily solved by using Fourier transforms to give (Problem 22-28) 


G s (r, t) = ( 4nDr ) 3/2 expj — 


[( x + uEr) 2 + y 2 + z 2 ]) 


4Z>t 


(22-134) 


and 


C(t)= N A 2 e “ iKxuEv e~ * 2Dx 
* Chem. Phys. Lett., 12, p. 81, 1971. 


(22-1 35) 
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If we choose the x-axis to be perpendicular to the incident radiation, we find that 
k x = k cos 0/2, and the power spectrum is 

/(to) = 2NA 2 \- — 1 (22-136) 

L(co — kuE cos £0) 2 + ( k 2 D) 2 \ 

Thus we see that the Lorentzian lineshape of the Rayleigh scattered light is shifted 
in frequency by Act) = kuE cos ^0 under the influence of the electric field. The half- 
width is still related to the diffusion constant, however, through A co 1/2 = k 2 D. 
Notice that Eqs. (22-130), (22-131), (22-134), and (22-135) do not contain ct) 0 . 
This is because in the heterodyne detection method, the scattered light is mixed with 
a strong component of the incident radiation in a nonlinear detector such as a photo- 
multiplier tube, and the measured spectrum contains only the low-frequency beats 
between co 0 and the scattered light, and hence the spectrum is centered at co = 0. One 
can use an autocorrelator to observe the correlation function directly, and such a 
measurement is shown in Fig. 22-6, which is from the paper by Ware and Flygare. 
It shows the autocorrelation function at zero field and several other fields in a solution 
of the protein, bovine serum albumin (BSA). The autocorrelation function in the 
absence of the electric field is an exponential decay with a decay constant, t = 1 /k 2 D. 
In the presence of the electric field, the autocorrelation function is an exponentially 
damped cosine whose decay constant is also 1/k 2 D and whose period is equal to 



Figure 22-6. The autocorrelation function at zero field and fields of 115, 135, and 154 V/cm, respectively 
in a 5 percent solution of BSA in 0.004 M NaCl titrated to pH 9.2 with w-butylamine. 
Ao = 5145 A, T- 10°C, and 0 - 0.079 rad = 4.5°. (From B. R. Ware and W. H. Flygare, 
Chem. Phys . Lett. y 12, p. 81, 1971.) 
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At = 2k/k x uE, which can be reexpressed in the form At = AJuEn sin 0 (Problem 
22-30). Thus a measurement of the autocorrelation function in the presence of an 
electric field provides a simultaneous determination of the diffusion coefficient and 
the electrophoretic mobility. By determining the period of the autocorrelation function 
as a function of the field E , one can determine the electrophoretic mobility by plotting 
1/AtE versus sin 0. Figure 22-7 shows such a plot obtained from the autocorrelation 
functions in Fig. 22-6. 

Note that if the homodyne method of detection had been used, the appropriate 
correlation function would be 

C(z) = | <<? s *(0K s (t)> | 2 (22-1 37) 

which, according to the above, is 

C(t) = NA 2 e~ 2K2Dz (22-1 38) 


Thus the homodyne method yields a frequency spectrum of the form 


NA 2 r 00 

/(G)) =— — f e >mz e~ 2KiDz di 

2.7C J - oo 

NA 2 2 Dk 2 
n a) 2 + (2 Dk 2 ) 2 


(22-1 39) 

(22-140) 


Notice that this is also a Lorentzian frequency distribution, but the half-width in 
this case is 2 Dk 2 , rather than Dk 2 as in the heterodyne method. Figure 22-8 shows the 
measured half-width versus sin 2 (0/2) [k 2 = (4nn 0 /X) sin 2 (0/2)] for polystyrene latex 
spheres from both the homodyne and heterodyne methods. This figure is taken from 



sin 0 

Figure 12-1. A plot of 1/A tE as a function of sin 6 where At is the period of the oscillation of the auto- 
correlation function (see also Fig. 22-6). The expected k dependence in the sinusoidal 
oscillation of the autocorrelation function is evident The slope of the line leads to an average 
value for the mobility of 14 x 10“ 5 cm 2 /sec • V which is corrected to 20°C to give 18 x 10 -5 
as shown. (From B. R. Ware and W. H. Flygare, Chem. Phys. Lett., 12, p. 81, 1971.) 
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sin 2 (0/2) 

Figure 22-8. Single Lorentzian half-widths at 25°C as a function of sin 2 0/2 obtained from homodyne and 
heterodyne spectra of polystyrene latex spheres, 0.126 ± 0.004 /ll diameter, 0.01 mg/mg; 
O : homodyne spectra; □ : heterodyne spectra. (From Cummins et al ., Biophys. 9, p. 
518, 1969.) 


the paper by Cummins et al* who give an excellent discussion of the experimental 
and theoretical details of the homodyne and heterodyne detection methods. 

Our results up to now have been for a system of identical scatterers. For a solution 
of s different kinds of spherical macromolecules, the spectrum is given by 


I((o) = f 2c A 2 Dj 
/(co) ^c jAj |^ 2 + (K 2 Dj)2 \ 


(22-141) 


where Cj is the number concentration of the yth scattering species. This is a super- 
position of Lorentzian curves, all centered at <o = 0, and, in general, it is impossible 
to extract the various diffusion constants Dj from the experimental data. 

If the s species have different electrophoretic mobilities, however, the s Lorentzians 
will be centered at different frequencies and 


/(CO) 2c J A J 2 [ (co _ KUjEc o S ±ey + ( ^,] 


(22-142) 


Thus if the frequency shifts are suitably different and larger than the widths k 2 D J9 
the separate Lorentzians can be resolved. Ware and Flygaref have studied solutions 
of bovine serum albumin (BSA) and BSA-dimers and BSA-BSA dimers-fibrinogen 
solutions. 

Equation (22-140) is valid only if the various species are totally noninteracting and. 


* Biophys. 9, p. 518, 1969. 
t J. Coll Interface Sci. 9 39, p. 670, 1972. 
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in particular, are not reacting chemically. Let us now consider the case in which the 
scatterers can interconvert by isomerization or charge transfer, according to 

k a 

A B (22-143) 

k b 

Equation (22-133) now becomes two coupled simultaneous equations 


SGa 

8t 


= D a V 2 G a + u a E 


Wa 

dx 


k a G A + k b G b 


dG B 

dt 


. dG R 

D b V G b + u B E -r — k b G b + k a G a 
ox 


(22-144) 


where a separate G s (r, t) has been defined for each species. These two coupled equations 
can be solved analytically by using Fourier transforms. The general result has been 
obtained by Haas and Ware* and is an extremely lengthy expression. The numerical 
results, however, have a straightforward physical interpretation. For very slow 
kinetics, the spectrum appears as two separate peaks as though there were no reaction. 
In the other extreme, where the reaction is very fast, the spectrum consists of a single 
peak centered at an intermediate frequency. For kinetics that occur on approximately 
the same time scale as either the shift kujE cos (6/2) or the width k 2 Dj of the non- 
reacting spectra, the peaks tend to broaden and move toward each other as the rate 
constants increase. This is shown in Fig. 22-9, which are computer-simulated spectra 
calculated from the lengthy equations of Haas and Ware for a hypothetical system with 
transport coefficients typical of globular proteins. Clearly the magnitude of the rate 
constants has a dramatic effect on the spectrum, suggesting that accurate determina- 
tion of kinetics may be possible by this method. 

The spectra associated with other types of kinetic schemes have also been calculated, 
and the reader is referred to the work by Berne and Pecora (“ Additional Reading ”). 
Although not many experiments of this sort have yet been carried out, it appears that 
laser light scattering will become a powerful method of probing reaction kinetics. 

So far we have discussed light scattering only from independent macromolecules 
or scatterers. One can also observe light scattering from liquids, and clearly in this 
case one cannot assume that the molecules doing the scattering are independent. An 
important approach to light scattering from such dense systems is to assume that the 
density fluctuations reponsible for the scattering can be described by linear hydrody- 
namics. Mountainf whose paper we follow here, was apparently the first to present 
calculations from a hydrodynamic model. 

If the scattering centers are not independent, we cannot ignore cross terms as we 
did in going from Eq. (22-121) to Eq. (22-122), and so Eq. (22-123) is modified to 
read (Problem 22-31) 

S(k, co) = — f e~ i03T F( K, t) dt (22-145) 


where we have used the notation S(k, co) in place of /(co) and 


F(k, t) = -£ 

N i,j 


(22-146) 


* Unpublished results, 
t Rev. Mod. Phys ., 38, p. 205, 1966. 
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Figure 22-9. Predicted electrophoretic light-scattering spectra for the kinetic system A ^±B. Simulated 
experimental conditions are D A — D B — 6 x 10" 7 cm 2 /sec, [i A - 1 x 10~* cm 2 /sec • V, 
fjbg — 2 x 10"* cm 2 /sec • V, E= 100 V/cm, 6 = 1°,A 0 = 5.145 x 10" 5 cm. (From D. D. 
Haas and B. R. Ware, private communication; see B. R. Ware, in “Additional Read- 
ing”) 

as in Eq. (22-9). Since the phase space operator that corresponds to the density is 

P(r, 0= E <5[r - r/0] (22-147) 

7=1 

we see that the operators in the ensemble average in Eq. (22-146) are the Fourier 
transforms of p(r, t) 9 namely, 

P(K, 0) = | dr e~ iK t p( r, 0) 

(22-148) 

P*(k, t) dr e lK r p(r, t) 

Thus Eq. (22-146) can be written in the form 

F(k, t )=Jf <p*( k > °)p( k - 0> 

= ^ <p-Mp*(t)> 


(22-149) 
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which shows that F( k, t) and hence the scattered spectrum S( k, w) can be determined 
from the correlation function of P(k, t). Mountain determines £(k, t) from the 
linearized hydrodynamics equations in the following manner. 

According to Eq. (21-286), the linearized Navier-Stokes equation is 

du , 

mp-=-Vp + ri s V 2 u + Qr) s + t} v ) V(V • u) (22-1 50) 


where we write rj s and tj v for the shear and bulk viscosities, respectively. The mass 
continuity equation 

dp 

+ dlV pVL =■ 0 (22—1 51 ) 

dt 


can be linearized by writing p = p 0 + p l9 where p 0 is the equilibrium density: 


dpi 

dt 


+ p 0 div u = 0 


(22-152) 


The linearized energy balance can be written as [cf. Eq. (17-30)] 


dE 

Pm dt 


—p div u + A V 2 7\ 


(22-153) 


We can use Eq. (22-152) to eliminate div uin Eqs. (22-150) and (22-153) and use 
straightforward thermodynamic manipulations to get (Problem 22-32), 

du c 0 2 , c 0 2 Bmp 0 , 

mpo — + — grad p x + grad T x - (fa + rj v ) grad div u = 0 (22-1 54) 

dt y y 

and 


dT x 

mPo^ 


cjy - 1) gpi 

p dt 


- A V 2 7\ = 0 


(22-1 55) 


where c v is the specific heat, y = c p !c v , p is the thermal expansion coefficient, c 0 is the 
zero-frequency speed of sound, and rj s and rj v are the shear and bulk viscosity, 
respectively. 

By taking the divergence of Eq. (22-154) to eliminate u through Eq. (22-152), we 
get two simultaneous partial differential equations in the two independent thermo- 
dynamic variables p x (r, t) and 7\(r, t). By taking the spatial Fourier transform and a 
Laplace transform with respect to time, these may be transformed into two simul- 
taneous algebraic equations for 


P(k, s) = J dr J dt e~ iK ' r e~ st pi(r, t) (22-156) 

t (k, s) = jdr J” dt e~ i ' c t e~ a Ti(r, t ) (22-1 57) 

We can then solve these equations for ^(k, t) in terms of p( k, 0), multiply by 

fi( — k, 0) and average over an equilibrium ensemble of initial conditions. The 

algebra involved is quite lengthy, but one eventually gets (see Mountain, in “ Additional 
Reading,” for details) 

<p- k (o)p*( 0> = < I p„(0) | 2 >'P(k, 0 


(22-158) 
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where 


¥( k , t) = [- — -\exp( — + - exp(— K 2 Tt)[cos c 0 Kt + 6(k) sin c 0 Kt] 

\ y / \ PocJ y 


(22-159) 


where T is the acoustic attenuation coefficient 


(22-1 60) 

(22-161) 

2A 2 S(k, 0), and so we can finally write 

S(k, <o) k 2 XIp 0 c„ 

S(k, 0) y to 2 + (k 2 X/p 0 c p ) 2 

1 f >c 2 r k 2 t 

+ y l( w + c 0 k) 2 + (fc 2 r) 2 + (to — c 0 k) 2 + ( K 2 r) 2 

b(} c) r C 0 K + 0) C 0 K — CO 

+ y [(g) + c 0 k) 2 + (k 2 F) 2 + (g> - c 0 k) 2 + (/c 2 r) 2 

In practice, this last term is difficult to observe. 

We see that the spectrum consists of a central (Rayleigh) component and two 
shifted (Brillouin) components. This is shown schematically in Fig. 22-10. The width 
of the Rayleigh peak is K 2 X[p 0 c p (thermal diffusion), and the width of the Brillouin 
peak is k 2 T. The ratio of the intensity of the Rayleigh component to the intensity of 
the Brillouin component is y — 1 (Problem 22-34). This is known as the Landau- 
Placzek ratio. Fleury and Boon* have observed the Brillouin scattering from liquid 


(22-162) 


wpo \PocJ 

and 

Equation (22-145) shows that <|p K (0)| 2 ) = 
(Problem 22-33) 



Figure 22-10. A schematic plot (not to scale) of the spectrum of light scattered from a fluid. (From R. D. 
Mountain, Chem. Rubber Co, Crit, Rev. Solid State , 1, p. 5, 1970.) 


Phys. Rev., 186, p. 244, 1969. 
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argon. In their experiment, k « 2.1 x 10 7 m -1 , and the observed Brillouin shift was 
» 1.8 x 10 10 sec' 1 , leading to a value of (1.8 x 10 lo )/(2.1 x 10 7 ) = 857 m/sec for the 
speed of sound in liquid argon under the conditions of the experiment, 85°K and 
592.5 mmHg. The low-frequency speed of sound measured acoustically is 853 m/sec. 
Mountain (“Additional Reading”) also gives an excellent discussion of light scat- 
tering from polyatomic fluids. The work by Fleury and Boon (“Additional Reading ”) 
provides a thorough and recent review of light scattering from fluids. 


22-4 THE MEMORY FUNCTION 

The direct evaluation of a time-correlation function is a prohibitively difficult task 
for an A-body system. Consequently it is not surprising that there have been many 
attempts to derive differential or integral equations whose solution would yield a 
correlation function. No simple or convenient equation (or set of equations) has yet 
been found, but one such attempt, due originally to Zwanzig (“Additional Reading ”), 
has led to a function, associated with a correlation function, which turns out to be 
very useful for both conceptual and computational purposes. This function, called the 
memory function , is the central topic of this section. We shall follow a derivation due 
to Berne, Boon, and Rice (“Additional Reading”) although the two works by 
Berne as well as Berne and Harp (“Additional Reading”) are also highly recom- 
mended. 

Let MO be a normalized autocorrelation function of the phase space function 
U(p,q); i.e., let 

MO = <U(0)U(t)> (22-163) 

with 

MO) = <U(0)U(0)> = 1 (22-1 64) 

Clearly U can be chosen in this way by defining U(t) = a(0/<<* 2 > 1/2 , where a(t) = 
<x(p, <T>t)is any suitable space function. For convenience, we also assume that 

<t/> =0 (22-165) 

If the angular brackets denote a canonical ensemble average, then we can write MO 
more explicitly as 


HO = ^jj d P d< l U (p, q) [exp (it&)U(p, g)^xp[~PH(p, q)] ( 22 - 1 66) 

where as usual (p, q) represents all the momenta and coordinates necessary to describe 
the A-body system, if is the Liouville operator, and Q is simply the canonical partition 
function. We have used Eq. (21-227) to write 17(0 as exp(z*if)t7(0). These brackets 
about exp (it&)U(p, q) are really unnecessary since H(p, q) is a constant of the motion. 
If we differentiate MO twice with respect to t, we get 

=2 -1 \dpdq U(p, q)[iSe exp (it&)i£?U(p, tf)]exp(- fiH) 


(22-167) 
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We can now integrate the right-hand side of this equation by parts or simply use the 
Hermitian property of to get (Problem 22-36) 

= ~Q~ l j dp dq V&ukxpiitswseukxpi-pH) 

= - <<Kp, q)zxp{it&ys>{p, q)y (22-1 68) 

where 

<b(p, q) = i&U = U 
since the time evolution of U satisfies 
dU 

— = il?U (22-169) 

Since exp(itJif)0(p, q) = U(p, q; t), Eq. (22-168) can be written as 

= -<t/(0)(/(0> s ~<Kt) (22-170) 

where the last equality defines <j)(t). Note that the initial conditions for this equation 

are 


*( 0 ) = 1 

^(0) = <(/(0)(7(0)>=0 (22 171) 

The Laplace transform of Eq. (22-170) gives 

s 2 #(s) - S = - <f)(s) (22-1 72) 

where s is the Laplace transform variable. If we substitute this into the identity, for 
s ^0, 


[s 2 {j/(s) - 5]| [s{j/(s) - 1] (22-1 73) 

we find 

-fayfo) = [l - Qfo)] [**(*) - 1] (22-174) 

For values of s such that 1 — (1 /s)<f>(s) ^ 0, this can be written as 
s^s) — i = — [i — (j)^] 

which is readily inverted by using the convolution theorem of Laplace transforms 
(Problem 22-37) to give 

^ = - f dx K(t)ip(t - t) (22-1 75) 

at Jq 

where 

k(s) = [1 - 


ls 2 $(s) - s]f(s) = (l + Q 


(22-176) 
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Notice that this is an integrodifferential equation for ij/(t ), Unfortunately the kernel 
K(r) is not a known function. Nevertheless, we can manipulate K( t) into an interesting 
form which will, in fact, turn out to be the memory function associated with ^(/). 
Since by Eq. (22-170) 

<Kt) = <U(0)U(t)> 

= <t/(0)exp(/^)(7(0)> 

its Laplace transform <f>(s) can be written in terms of the so-called resolvent operator 

(j-iSf)- 1 : 

$(S) =<U(S- i&)- l Uy ( 22 - 177 ) 

At this point we define a projection operator first introduced into statistical mech- 
anics by Zwanzig,* such that if G(p, #)is a suitably well-behaved phase space function, 
then 


&G(j>, q) = U(p, q)f cq J dp' dq' U(p\ q')G(p', q’) ( 22 - 1 78 ) 

where f c q is simply 

feq = Q 1 exp (~PH) 

The operator “ projects ” the phase space function G(p , q ) onto U{p , q). It is easy to 
prove that 0> 2 G = ^G, which is one of the requirements of a projection operator 
(Problem 22-38). 

By using the operator identity (Problem 22-39) 

( 22 - 179 ) 

with 


sf =s-i<£ 


and 


we can rewrite Eq. (22-177) for <f>(s) in the form 

<#>(*) = <#[* - uy + <g[5 - iser x i9se\s - z(i - &) 2 T X uy 

( 22 - 1 80 ) 

Now we can use the definition of & [Eq. (22-178)] to rewrite the second term in this 
equation. First write this second term as J dp dq U[s — i£^\~ 1 i0 > G y where 

An application of Eq. (22-178) gives then 
9G = Uf tq jdp dq U&ls - i( 1 - 
= vf^u&is - i(i - 

= iUf tq {U[s - i(l - (22-1 81 ) 


* J. Chem. Phys . , 33, p. 1338, 1960. 
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where to derive the last step we have used the Hermitian property of .£?; i.e., we have 
used the fact that [cf. Eq. (21-230) and Problem 22-40] 

<a*i? p} = <CS?a)*j8> = Kdpy (22-1 82) 

We might point out at this point that these manipulations are very similar to those 
involving the Dirac notation (Appendix G), and, in fact, the articles by Berne and 
Harp (1970) and Berne (1971) given in “Additional Reading” exploit this analogy 
to develop an elegant notation for the time-correlation function and memory function 
formalism. 

Using Eq. (22-181), then, in Eq. (22-180), we get 

to) = <u[s - i(i - &)&y l oy - <tf(i - ise)- l uy<jQ[s - /(i - &)&y l uy 

(22-183) 

Again if we use the operator identity given by Eq. (22-179) with si =s — i& and 
$S =s, we see that (Problem 22-22) 

(G(s - t&y'uy =-{uuy + - <#($ - i<ey x i<£uy 

s s 


1 


s 


( 22 - 1 84 ) 


But note that this is simply $( s)/s according to Eq. (22-177). Thus Eq. (22-183) reads 


1 ♦ 


<£(*) = - i(i - - - fas) <u[s - i( i - 0)&r x uy 

s 

and if this is substituted into Eq. (22-176) for f£(s\ we get that 
£(s) = <(7[j - f( i - &)&}- x uy 
which by inverse transformation becomes (Problem 22-41) 

K(t) = <t7 exp[/7(l - &)&]Uy 


( 22 - 1 85 ) 


( 22 - 1 86 ) 


This is the desired equation for the memory function K(t). Note that the operator in 
this equation contains the projection operator. 

We can summarize this algebraically long derivation quite simply by writing that if 
is a normalized correlation function of U and <{/> =0, then ^(f) satisfies the 
integrodifferential equation 


£ K{z)\l/{t - t) dx 


where the kernel of this equation is given by 
K( t) - <£/exp[tf(l - ^]Uy 


( 22 - 1 87 ) 


(22-1 88) 


and is called the memory function. 

We shall see shortly that the memory function can be very useful for computational 
purposes, but before doing so, we shall discuss some of its general properties. For 
example, using the fact that (1 — &)<£ is Hermitian (Problem 22-42), i.e., 


< <7(1 - gyrn = <10 - 0W9\*r> 
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for any two suitably well-behaved phase space functions / and g, it follows that 

(g exp[/7(l - = </exp[-i7(l - &)2’]g*') (22-1 89) 

and that a memory function is stationary. Furthermore, K(t) is an even, real function 
of time. To prove this, start with 

K(t) = <(7 exp[/7(l - &)<£\uy 

= Y.—A (i&U)[i(l - &)&Y(i&U)> (22-1 90) 

n=0 ^ • 

Consider the term arising from n = 1. This is 

<Sjseu)[i{\ - &)<e\iseuy> = < (tseu) [i&QSfuyy - <{i&u) w<£{i<euyb 

Clearly 

<itsev) [i&(i&u)] = (Wy = 0 

Also 

((i&U) = <(i&U)Uy 

= 0 

since <Xi£?U)uy = <£/£/> =0. By iteration, one can show that all terms arising 
from odd values of n vanish, and therefore K(t) is a real even function of time 
(Problem 22-43). One can also use the Schwartz inequality to prove that (cf. Problem 
21-55) 

- < uuy < K(t) < < uuy ( 22-1 91 > 

One of the most important properties of the memory function, however, is that it 
seems to lend itself very readily to simple approximations. For example, Eq. (22-187) 
shows that if we assume that K(x) decays extremely rapidly, e.g., #(t) » £(t), then the 
correlation function decays exponentially. Although this form of a correlation function 
is not rigorously correct, we have seen in these last two chapters that such an approxi- 
mate form leads to classic physical theories such as the Debye theory of dielectric 
relaxation and the Enskog hard-sphere transport theory. Since such a simple form for 
tf(r) leads to useful expressions for the correlation function ^(t), one might hope 
that more detailed assumptions for #(r) would lead to an even better ^(f). Such an 
approach has been used by a number of authors, but here we shall continue to follow 
Berne, Boon, and Rice,* who were one of the first to explore this idea. A more recent 
paper by Harp and Bernef gives a good summary of the various approaches. 

We shall consider the memory function associated with the normalized velocity 
autocorrelation function 


<K0 


<vi(0) • v,0)> 
<Vl 2 > 


(22-192) 


Since this correlation function involves a vector dot product, we must generalize some 
of the above concepts. For example, the projection operator becomes 

0>G(p, q) = U (p, ?)/ eq | dp dq U (p, q) • G(p, q) (22-1 93) 


* J. Chem. Phys., 45, p. 1086, 1966. 
t Phys. Rev., A2, p. 975, 1970. 
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It is straightforward to show that since 

iSto x 

m 

then the kernel function R(s) becomes (Problem 22-44) 

a-) [i ,- <*■ ».o» r <*•>.<■» (22 . 1M , 

1 ' [ J m I W'> ' 1 

where F t is the force on molecule 1 and ti(s) is the Laplace transform of F t (0* The 
value of£(0) is easily determined from the Einstein relation for the diffusion coefficient. 
For 


D = — I dt 
m J 0 

kT r 00 

= — lim f dte~ st W) 
m s -o J 0 

kT 

= — $(0) (22-195) 

m 


But according to Eqs. (22-174) and (22-176), 
s\f/(s) — 1 — fc(s)\j/(s) 
and so we see that 


D = [*(0)]- (22-196) 

Thus £(0) is directly related to the diffusion coefficient. 

Berne et al. argue on several grounds that a good approximation for K(t) is an 
exponentially decaying function 

y,t)=y exp(-a| 1 1) (22-1 97) 


where a and y are adjustable constants. One can readily show that this form for the 
memory function leads to a correlation function of the form (Problem 22-45) 


iKO 

where 


1 


•s'. 


(•s + e s " f — s.e 8 *') 


(22-1 98) 




1/2 


(22-1 99) 


This approximate ^(f) is tested against the molecular dynamics data of Rahman.* 
The two parameters a and y are determined from Rahman’s “ data.” From Eq. (22-196), 
we see that 


y kT 
a mD 


Phys. Rev., 136, p. A405, 1964. 



578 THE TIME-CORRELATION FUNCTION FORMALISM, II 


In addition, since (Problem 22-56) 


K( 0 ) 


CFi 2 > 

W 2 ^! 2 ) 


we find that 


_ <Fi 2 > <fs> <v t 2 ^> 

^ w 2 <t>i 2 > 3wA:!r 3w 

where t/ N is the total intermolecular potential and V t 2 denotes the Laplacian with 
respect to particle 1. Using Rahman’s data, Berne et al find that <y t 2 U N y = 11.0 
x 10 3 erg • cm -2 . Using Rahman’s value of D = 2.43 x 10“ 5 cm 2 /sec, it is found that 
a = 8.06 x 10 12 sec -1 and y = 6.5 x 10 25 sec “ 2 . With these values of a and y, the s ± 
are complex and 

m = exp(— [cos( 4 : °y^ ) + 3“ 1/2 sin ( 4 -° 3 / 3f )] (22-200) 

where t 0 = 10“ 12 sec. This autocorrelation function is plotted in Fig. 22-11 along 
with Rahman’s molecular dynamics value and a simple exponential approximation 
for ^(f). One sees that the qualitative features of the autocorrelation function are 
reproduced. Although the agreement is not perfect, it is important to note that the 
theoretical function correctly predicts a negative region for i//(/), despite the very 
simple form of the trial memory function. 

Berne et al. also calculate a normalized power spectrum, defined by 

kT r°° 

G(co) = — — I dt cos cot (22-201 ) 

tn D Jq 

This is essentially the function f(co) defined in Eq. (22-48), but here 
G(0) = 1 



Figure 22-11. The linear momentum autocorrelation function ip(t) from Rahman’s molecular dynamics 
calculations (1), from the Markovian approximation = exp(— j3|f|) (2), and Eq. 

(22-200) (3). (From B. J. Berne, J. P. Boon, and S. A. Rice, J. Chem . Phys., 45, p. 1086, 
1966.) 
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Figure 22-12. The normalized power spectrum G(io) from Rahman’s molecular dynamics calculations 
(1), from the Markovian approximation (2), and from Eq. (22-202) (3). (From B. J. 
Berne, J. P. Boon, and S. A. Rice, /. Chem. Phys. 9 45, p. 1086, 1966.) 

It is easy to show that Eq. (22-197) leads to (Problem 22-46) 

y 2 

y 2 a 2 Q _ 2y/a 2 )o 2 + co 4 
0.420 

" 0.420 - 6.5 x 10 - 27 a> 2 + 10" 52 co 4 (22-202) 

This theoretical power spectrum is plotted in Fig. 22-12 along with the Markovian 
approximation (which leads to a Lorentzian power spectrum) and the molecular 
dynamics data of Rahman. Again the agreement is seen to be fairly good. 

It appears then that the memory function offers a convenient computational 
approach to linear response theory calculations. 

22-5 DERIVATION OF THERMAL TRANSPORT COEFFICIENTS 

In Section 21-8 we derived time-correlation function expressions for the thermal 
transport coefficients by starting with the appropriate macroscopic transport equation. 
Since these derivations are based on the classical macroscopic transport equations, 
the resulting expressions are in some sense restricted to long-time, small wavelengths. 
In this section we shall rederive these results within a memory function formalism and 
investigate the long-time, small wavelength limit in some detail. We shall follow the 
1971 review article of Berne (“Additional Reading”). 

We start with the diffusion coefficient. In Section 21 -8 we showed that [Eq. (21-264)] 

F s ( k, 0 = exp(-A: 2 Z)0 (22-203) 

where this results by taking the Fourier transform of the macroscopic diffusion 
equation. We also saw in Section 21-8 [Eq. (21-271)] that 

F^k, 0 = <r <k ‘ f(0) ^ ,r(,) > 


(22-204) 
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This correlation function obeys the memory function equation 

= — ( dz K(k, t)F s ( k, / - t) (22-205) 

Ct Jq 

where 

A:(k, o = </ <eu exp [«(i - &)<e\i£eu'y ( 22 - 206 ) 

with 

U = exp(ik • r) (22-207) 

Since iSFU = (l = z'k • v exp[ik • r], it follows that 

K(k, t) = k • <v exp(— ik • r) exp[rt(l — exp(ik • r)) • k 

= k • D k (f) • k (22-208) 

where the tensor D t (r) is defined by 

D*(0 = <v exp(— ik • r)exp[/i(l — 0)£f\pi exp(/k • r)> (22-209) 

Equation (22-205) becomes 
dF r‘ 

-f = - f dz (k • D t (t) • kJF s (k, / - t) (22-21 0) 

at "o 

By taking the Laplace transform of this equation, we get 

Pj[k, s) = [s + k • D k (.s) • k] 1 (22-21 1 ) 

where the carets denote the Laplace transform of that function. This should be 
compared to the Laplace transform of F s (k, t) given by Eq. (22-203), namely, 

p£ k, i) = (s + k 2 D)~ l (22-21 2) 

We see that D t (s) is a complicated k- and ^-dependent diffusion coefficient. We wish 
to see under what conditions, i.e., in what limit, Eq. (22-21 1) goes over into Eq. (22-212). 
Using Eq. (22-203) as a guide, we redefine the time scale by introducing 

t =k 2 t (22-213) 

We now consider the double limit 

lim lim F,(k, t ) = lim lim e~ Dkl ‘ = e~ Dx (22-21 4) 

f-»oo k~*0 t-*oo k~*0 

r = const r=const 

This limit selects out from the exact F s (k, t) given in Eq. (22-211) the small k and 
long-time behavior; i.e., it selects out the hydrodynamic limit, where the wave vectors 
are small (smoothly spatially varying properties) and the time is long (compared to 
molecular times). To take this limit of the exact FjQl, t), we formally take the inverse 
of Eq. (22-211), 

F s (k, t) = | ds [s + k • D t (s) • k]" 1 (22-21 5) 

2niJ 

introduce the new time variable x=U l t and a corresponding new Laplace transform 
variable s = k 2 x to get 



k - D k (k 2 x) ■ k 
k 2 



( 22 - 21 6 ) 
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We now take the double limit of this equation 

lim lim F s ( k, /) = (fdxe xt [x + lim k * O k (k 2 x) • -^-l 
f-oo fc->o 2ni J L *->o k J 

x — const 

r /.. .. k • D t (s) • k\ 1 

= exp — (lim lim It 

L \s-»o»-»o kr / J 

By comparing this with Eq. (22-214) we see that 

k • D*Cs) • k 

D = lim lim 

S-»0 k-*0 k 


(22-217) 


( 22 - 21 8 ) 


where we have defined 


D k {s) = 


k * D k (j) • k 


This is not yet a useful expression for D since O k (s) is expressed in terms of a projection 
operator in Eq. (22-209). We can obtain a more useful expression for D by starting 
with Eq. (22-177), which in the present notation becomes 

$(k, s) = k • <v exp(— ik • r)[j — exp(/k - r)> • k (22-21 9) 

£(k, s) 

[1 + (l/s)£(k, s)] 
k 2 D k (s) 

= 7: 2 , . ~ . . , (22-220) 

[1 + (k 2 fs)D k (s)] 


( 22 - 220 ) 


( 22 - 221 ) 


so that 


— r $ (k, s) = 


to 


k 2 " v ’ ' [l+(A 2 A)tol 

Now according to Eqs. (22-218) and (22-221) 


D = lim lim D k (s) 

s-*0 Jc— * 0 


and so 


lim lim $(k, s) = D 


s~*0 k -¥ 0 


Using Eq. (22-219) for $(k, s), we see that 

D = lim lim k • <v exp(— ik • r)(s — iif) _1 v exp(ik • r)> • k 

S-»0 Jt-»0 


( 22 - 222 ) 


(22-223) 


In this limit, the exponentials in the above expression may be replaced by unity, so 
that the ensemble average becomes <v (s — i-£f) _1 v> which is the Laplace transform 
of <v(0)v(f)>- It is straightforward to show that for an isotropic fluid that (Problem 
22-51) 


< v (0)y(0>=Kv(0)-y(0>l 

where 1 is the unit tensor. Thus 

<v(j - i&Y 1 y> = * (s - i&Y 1 y>l 


(22-224) 
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We now substitute this into Eq. (22-223) and use the fact that k • I • k/k 2 = 1, we 
finally get 

D = lim^<v • (s - i'J 2 ?) -1 v> 

s ->0 

1 r 00 

= lim - dt e~ 5 '<v(0) • v(*)> (22-225) 

s-*o 3 'q 

which is essentially the result derived in Section 21-8 by starting with the diffusion 
equation. Equation (22-225), however, was derived as the limit of a general expression. 
If s is replaced by ico in the above equation, we see that the “ordinary” diffusion 
equation can be thought of as a low-frequency limit. In fact, we can define a frequency- 
dependent diffusion coefficient through Eqs. (22-223) through (22-225). Let 

D(co) = lim D k (co) 
o 

where ico ~ s. Equation (22-223) becomes 

D{co) = lim k - <v exp(— ik • r)(/co — iJSf)^ 1 v exp(/k • r)) • k 

fc -0 

and Eq. (22-225) becomes 

i 00 

D{w) ~ - f dt e _i “'<v(0) • v(<)> (22-226) 

3 J 0 

Zwanzig* has shown that D{co) and Z>(k, co) can be defined through a generalization 
of Fick’s first law of diffusion to arbitrary space-time variations of the driving force: 

j(k, co) = - D(k, Q))(Vc) t> „ (22-227 ) 

where j(k, co) is the Fourier component of the flux of the diffusing species, as a response 
to the (k, co)th Fourier component of the gradient of the concentration of the diffusing 
species. 

In Section 21-8 we also derived a time-correlation function expression for the shear 
viscosity by starting with the linearized Navier-Stokes equation. Equation (21-307) 
shows that the shear viscosity is given by 


dt <mm 

where 

We defined a correlation function C ± (k, t) [Eq. (21-292)] 


</ x *(k,0)/ x (M)) 
</l*( k, 0)/ x (k, 0)> 


exp(— k 2 vi) 


where v = r\fmp and 

/(k, o = i: xfty kz ^ 

j = i 


(22-228) 


(22-229) 


(22-230) 


(22-231) 


Phys. Rev., 133, p. A50, 1964. 
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Since C ± (k, t) is a normalized autocorrelation function of the phase space function 
/ x (k, t ) 


u </x<ky x (k» i/2 

then it satisfies a memory function equation 

ac x 


dt 


= -j dr K ± (k, t)C ± (k, t - t) 


where 

KJk, t ) = (JSPU exp[rt(l -0>)&\iSeuy 
where U is given by Eq. (22-232). The analog of Eq. (22-21 1) is 
C' 1 (k,s) = [s + R 1 (k,s)]- 


(22-232) 


(22-233) 


(22-234) 


(22-235) 


One proceeds in much the same way as we did above for the diffusion coefficient, and 
one can show that (Problem 22-52) 


v = lim lim £ ± (k, s ) 
s-»0 k->0 k 


(22-236) 


and that (cf. Berne (1971) “Additional Reading”], 

*1 = lim -Jr=, f dt p _s, <y(0)/(0> (22-237) 

s -o Vkl J 0 

Again we can define a frequency-dependent shear viscosity by 

¥«>) = ^ dt (22-238) 

Zwanzig and Mountain* have presented an interesting application of this formula. 
When a mechanical force is applied suddenly to a fluid, the fluid responds elastically 
at first, just as if it were a solid body. The initial response may be described by two 
quantities, the high-frequency limit G ^ of the shear modulus (or modulus of rigidity) 
and the high-frequency limit of the bulk modulus (or modulus of compression). 
Zwanzig and Mountain argue quite generally that 


G(co) = icoY\{co) 


and 


(22-239) 


K(o) = K( 0) + icoK(co) 


(22-240) 


where k(co) is the frequency-dependent bulk viscosity. We shall consider only the 
shear modulus here. We wish to find the high-frequency limit of rj(co). This can be 
done by generating a power series in 1/co in the following way. Let cot = z in Eq. 
(22-238) to get 




(22-241 ) 


J. Chem. Phys 43, p. 4464, 1965. 
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Assuming the time-correlation function has an expansion in powers of t with a non- 
zero radius of convergence, we may write (Problem 22-53) 

<22 - 242 > 

where 

m ~<mm> 

1i =4<w(°)> 

and so on. Thus we see that the high-frequency shear modulus G m is given by 

=^ <7( ° )y(0)> (22-243) 

which is just an equilibrium average of J in Eq. (22-229). After some manipulation, 
one finds that 


C„-pkT + f } p‘ J" 4r „<r> ± (r‘ £) (22-244, 

for a pair-wise additive intermolecular potential (Problem 22-54). Zwanzig and 
Mountain give an excellent discussion of high-frequency elastic moduli of simple 
fluids. 

We have seen in this section that it is possible to extend the formulas for the thermal 
transport coefficients to include frequency dependence and wave vector dependence. 
The extension of the hydrodynamic equations themselves is presently a very active 
and interesting area of the statistical mechanical theory of transport and is called 
generalized hydrodynamics. Several of the references in the “Additional Reading” 
should serve as an introduction to this field. 
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Berne, B. J. 1971. In Physical chemistry , an advanced treatise , Vol. 8B, ed. by D. Henderson. New 
York: Academic. 

•, and Forster, D. Ann. Rev. Phys. Chem ., 22, p. 563, 1971. 

, and Harp, G. D. Adv . Chem. Phys. , 17, p. 63, 1970. 

Harp, G., and Berne, B. J. Phys. Rev., 2A, p. 975, 1970. 

Mountain, R. D. Chem. Rubber Co. Crit. Rev . Solid State, 1, p. 5, 1970. 

Zwanzig, R. W. 1961. In Lectures in theoretical physics. Vol. 3. New York: Interscience. 

Neutron Scattering 

Egelstaff, P. A. 1965. Thermal neutron scattering. New York: Academic. 

Larsson, K. E., Dahlborg, V., and Skold, K. 1968. In Simple dense fluids, ed. by H. L. Frisch and 
Z. W. Salsburg. New York: Academic. 

Marshall, W., and Lovesey, S. W. 1971. Theory of thermal neutron scattering. Oxford: Oxford 
University Press. 

SkOld, K., et al. Phys. Rev., 6A, p. 1107, 1972. 
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Memory Functions 
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York: Academic. 

— , Boon, J. P., and Rice, S. A. J. Chem. Phys., 45, p. 1086, 1966. 

, and Harp, G. D. Adv. Chem. Phys., 17, p. 63, 1970. 

Generalized Hydrodynamics 

Ailawadi, N. K., Rahman, A., and Zwanzig, R. W. Phys. Rev., A4, p. 1616, 1971. 

Kadanoff, L. P., and Martin, P. C. Ann. Phys., 24, p. 419, 1963. 
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PROBLEMS 

22-1. Derive Eq. (22-7) from Eq. (22-6). 

22-2. Derive Eq. (22-12). 

22-3. Using the fact that S(x, w) is real, prove that G(— r, t) = G*(r, /). 

22-4. Prove that Eq. (22-24) reduces to Eq. (22-16) when aj—1 and the classical limit 
is used. 

22-5. Derive Eq. (22-25). 

22r-6. Derive Eq. (22-27). 

22-7. Derive Eqs. (22-30) and (22-31). 

22-8. Prove that F s (k, t) can be written in the form 

F,(k, t ) = exp[-K 2 yiO) + K*y 2 (t) H ] 

where 


y.W=i<(r 1 (0-ri(0)) 2 > 

22-9. Derive the non-Gaussian corrections to F,(k, t). 
22-10. Prove that y,(r) [Eq. (22-36)] has the expansion 


yi(f) 


t 2 


2fim 


<*,( 0 ) 2 >^ + 


and that 


S,„c(k, < 1 >) = (47ra)- , ' 2 e-“ 2 ' 4 «{l + 


j3 2 m 2 
4! k 2 


<*i(0) 2 >//4 


(£H 


where a=K 2 /2j9/n and H n (x) — (—l) n e x2,2 (d l, /dx n )e~ x212 is the nth Hermite polynomial. 
22-11. Prove that G,(r, t) can be written in the form 

G,(r, t ) = [4*/>(0]- 3 ' 2 exp J- X [1 + b 6 (t)H b (xr) + • • •] 
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where 


and 


be 


«2 (0 
24 


a 2 


<r 4 > 


3<r 2 > 2 


Rahman* has calculated non-Gaussian corrections to G s (r, t) by molecular dynamics. 
22-12. Define the moments of either S CO i,(x, <*>) or Si„ c (x, w) by 


= I o>"5(x, <o) 

— 00 

Prove that 

<a.°>, nc =l 

<*o°> a *=l+jdre , *‘'g(r) 

K 2 

<fc> 2 > lnc = kT — 


K 2 


<cu 2 >eoh = kT — 
m 


<wA% ° b = frn 



22-13. In 1958, Vineyardf introduced an interesting approximate expression for G d (r, t) 
in terms of G s (r, t) and the equilibrium radial distribution function. First define P( r', t\r, t) 
to be the conditional probability that a different particle is at (r', t) if the first, which had been 
at r = 0 at t = 0, is at (r, t). Show that P( r', f[r, t) satisfies the relations 


limP(r',/|r,r)=^(r') 

f -»0 

!im P(r', f|r, t) = g(r' - r) 


where g is the equilibrium radial distribution function. Show that 


J dr P(r\ /[r, t)G,(r, t) = G,,(r', t) 

Unfortunately, P(r', t |r, t) is not known. Vineyard approximated it by its long-time value 
g( r' — r) to derive the approximation 


J dr g{r' — r)G,(r, t) = G d c ( r', /) 

where the superscript c denotes “ convolution ” approximation for G d . Show that Vineyard’s 
convolution approximation can also be written as 


F d (x, /) = G(x)F.(x, t) 


* Phys. Rev., 136, p. A405, 1964. 
f Phys. Rev., 110, p. 999, 1958. 
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where G(x) is the Fourier transform of g( r). Show that this is equivalent to 
F(x, t) = S(x)F,(x, 0 

where S(x) = 1 -h G(x) is the structure factor. Rahman* has given an interesting molecular 
dynamics discussion of the convolution approximation. 

22-14. Show that 

kT (1 — cos w 0 1 ) 

yi(0= m ^ 

for a one-dimensional harmonic oscillator. 

22-15. Give an example of a stochastic process in which a is a continuous random variable. 
22-16. Prove ParsevaFs theorem. 

22-17. What form does ParsevaFs theorem take on if x(f) is written as a Fourier series: 
00 00 

x(t ) — 2 a « cos <o„ / + 2 sin o>„ t 

n = 0 n = 1 

22-18. Derive Eq. (22-85). 

22-19. Prove that if X t and X 2 are Gaussian random variables; then Y= X t + X 2 is 
also Gaussian. 

22-20. Derive Eq. (22-100). 

22-21. Prove that if Xi(t) and X 2 (t) are Gaussian random variables, then the second 
probability distribution function is given by Eq. (22-102). 

22-22. Derive Eq. (22-184). 

22-23. Calculate the power spectrum associated with a correlation function of the form 
C(r) = e" ,t,/tc 

Plot S(w) versus In w. Notice that S(w) is fairly flat out to w « l/r c , at which point it decreases 
as cu -2 . One calls this flat region a “white” region since it contains all frequencies equally. 
What form of* C(r) would lead to a white spectrum for all frequencies? 

22-24. Prove that if X(t) is a Gaussian random variable, then it can be written as 

x{ t )= i 

n= — oo 

with 

(Cni*C n ) = c,, 2 S nm 

and 

<Ci*C/C**Ci> = o t 2 o k 2 S tJ 8 k i + ofofhuhju 

22-25. Prove that if X(t) is a Gaussian random variable, then 
<x*(t)x(t)x*(0)x(0)> = | <x*(0)*(0)> | 2 + [ 1 2 

22-26. Derive Eq. (22-118). 

22-27. Derive Eq. (22-123). 

22-28. Solve Eq. (22-133). 

22-29. In an electrophoretic light-scattering experiment, prove that if the direction of 
the electric field is perpendicular to the incident radiation, then = k cos (0/2). 

22-30. In an electrophoretic light-scattering experiment, prove that the autocorrelation 
has a period of oscillation given by At = A 0 /uEn sin 0. 


* Phys. Rev., 136, p. A405, 1964. 
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22-31. Show that if the scattering molecules are not independent, then 
A 2 r® 

I(w) = — I e~ llliT F(yt, r) dr 

TT J -co 

where 

F(x, /)= <p(-x, 0)p(x, /)> 

where 

p(x, 0 = 2 expfix • r/0] 

J 

22-32. Derive Eqs. (22-154) and (22-155). 

22-33. Derive Eq. (22-162) from Eq. (22-159). 

22-34. Prove that the ratio of the intensity of the Rayleigh component to the Brillouin 
component is y — 1, where y — C p jC v . This ratio is called the Landau-Placzek ratio. 

22-35. Derive the basic light-scattering formulas starting from Eqs. (J-27) and (J-28). 
22-36. Derive Eq. (22-168) from Eq. (22-167). 

22-37. Derive Eq. (22-175). 

22-38. Prove that 0 2 G = 0G, where 0 is the projection operator defined in Eq. (22-178). 
22-39. Verify the operator identity 

22-40. Prove that 

< a *jS?p> = /<dj3> 

22-41. Show that the inverse Laplace transform of 

K(s) = < U[s - /(I - 0)&] ~ 1 uy 
is 

K(t)= <t7 exp[/Y(l -0)&]Uy 

22-42. Prove that 0 and Jzf, and hence (1 — 0)S£^ are Hermitian operators. 

22-43. Prove that the classical memory function K(t) is a real, even function of time. 
22-44. Derive Eq. (22-194). 

22-45. Prove that a memory function of the form 

m = ye~ 

leads to a correlation function of the form 




5-e s+t ) 


where 


-sH-eri 


22-46. Prove that if the memory function associated with the velocity autocorrelation 
function has the form K(t) = y exp(— a| t|), then 


ki r 

G(a)) = — - dr cos wt 

mD Jq 

= r! 

+ a 2 (l — 2y/a 2 )a> 2 + y 2 
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22-47. Berne, Pechukas, and Harp* have used the ideas of information theory to derive 
an approximate expression for G s (r, t) (see also Problem 22-13). Suppose we know <r 2 (f)>, 
where 

<r 2 (/)>-Jrfrr 2 G s (r, t) 

The information functional associated with the conditional probability distribution G s (r, t) is 
/[<?.( r, /)]=-Jrfr G,(r, t) !n G,(r, t) 

We wish to maximize this information subject to the two constraints 
J dr G,(r, t)= 1 
J dr r 2 G,(r, t) = <r 2 (0> 

Use the method of Lagrange multipliers to show that 
G.(t, t ) = [J7r<r 2 (0>]- 3 ' 2 exp 


which is the well-known Gaussian approximation for G,(r, t). Show that 

<r 2 (/)> = 2 f dr(t- t)<v(0) ■ v(t» 

•*o 

so that G*(r, t) is completely determined under this approximation if <v(0) • v(r)> is known. 

22-48. Starting with Eq. (22-101), show that if v(0 can be considered to be a Gaussian 
random variable, then the joint probability of v(f) and v 0 is 


p(m. 


, Vo) = £ 


m 


2irkT(l - v F 2 (f)) 


] 3 ' 2 I" m ( ! ) 2 + Do 1 -2vvo'F(0)1 
j CXP [ 2 kT (1 - Y 2 (/)) J 


where 'F(z) is the normalized velocity autocorrelation function. 
Defining a transition probability K(v , f|v 0 , 0) through 

P(y(t), Vo) = K(y, t\ Vo , 0)/(v o ) 


where /(v 0 ) is a Maxwell distribution, show that the Gaussian transition probability has the 
form 


K(y, f|v 0 , 0) 


-[ 


m 


lirkm - T 2 (0) 


- 3/2 

; exp 


m (v-v o T-(0) 2 ‘ 
2 kT (1 - Y 2 (0) . 


22-49. Berne, Pechukas, and Harp* have used the ideas of information theory to approxi- 
mate the probability that an atom will have a velocity v at time t given that it had the velocity 
v 0 initially. Denote this conditional probability by J£(v, t \ v 0 , 0). Since the joint probability 
of (v, t) and (v 0 , 0) is 


P(y(t\ Vo) = K(\, 1 1 v 0 , 0)/(v o ) 


where /(v 0 ) is the Maxwell distribution, the information functional that is used is 
/[P] = JrfvJrfvo P(v(/), Vo) In P(v(0, Vo) 


*/. Chem. Phys. t 49, p. 3125, 1968. 
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Show that the maximum of this subject to the two constraints 

JJ dv dv o P(v(t), Vo) = 1 

JJ dvdv „ (V • Vo)mt)Vo) = <Vo 2 mt) 
gives 


•K(v, v 0 , 0) | 2irA . r(1 _ ex p| 2kT (1 - 'F 2 (/» ) 

which is a Gaussian transition probability, i.e., the transition probability that one would 
obtain by assuming that the variable v(f) can be treated as a Gaussian random variable 
(see the previous problem). The next problem discusses a very interesting test of this assump- 
tion using molecular dynamics calculations. 

22-50. Harp and Berne* have presented molecular dynamics calculations of the linear and 
angular momentum autocorrelation functions and have made a thorough analysis of the 
assumption that v(f) can be treated as a Gaussian random variable. If, in fact, v(f) were a 
Gaussian random variable, the joint distribution of v(f) and v(0) would be (cf. Problems 
22-48 and 22-49) 

( m l 3 ' 2 ( m (v 2 + vo 2 - 2v • v 0 

■P[v(0, v(0)] - | 2d( . r(1 _ T 2 ( , )) ] ex P|~ (1 - »F 2 (f )) 

where Tty) = <v(r) • v(0)>/<i; 2 >. The key point here is that a knowledge of Tty) is all that is 
required to calculate any autocorrelation function involving a higher power of v. For example, 
show that the normalized translational kinetic energy autocorrelation function & i(t\ where 

. _ <®W(0> 

^ ( ' ) = - < 8 4 > 

is given in terms of T(r) under the Gaussian approximation as 
M)=Hl + ?V 2 (t)] 

Show also that for &*{t) — <i; 4 (0)f 4 (r)>/<f 8 >. 



M) = — [225 + 600'F 2 ty) + 120T r4 ty)] 


Harp and Berne have evaluated ^ 2 (/), and £*(t) by molecular dynamics and have 
compared these to the Gaussian approximations, and the results for S^(t) and <o 4 (t) are shown 
in Fig. 22-13. It can be seen that the agreement is moderately good. This type of calculation 
is an excellent illustration of the utility and power of molecular dynamics calculations. 

22-51. Prove that 
<v(0)v(0> = Kv(0) • v(0>l 
in an isotropic medium. 

22-52. Derive Eq. (22-236). 

22-53. Derive Eq. (22-242). 

22-54. Derive Eq. (22-244). 

22-55. Zwanzigt has given an interesting relation between the incoherent neutron scat- 


* /. Chem. Phys 49, p. 1249, 1968 
t Phys . Rev., 133, p. A50, 1964. 
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(a) (b) 

Figure 22-13. (a) The autocorrelation function <? 2 (0 from a Molecular dynamics simulation for liquid 
CO and the autocorrelation function & 2 c{t) from the Gaussian approximation, (b) The 
autocorrleation function <f 4 (r) from a molecular dynamics simulation for liquid CO and 
the autocorrelation function <? 4C (0 from the Ganssian approximation. (From G. Harp and 
B. J. Berne, /. Chetn. Phys. 9 49, p. 1249, 1968.) 


tering function 5i„ c (k, w) and the generalized diffusion coefficient D(k, to). 5 in c(k, cu) is given 
by 

5,„ c (k, a>) = ^ J dt e-'»\ P (-k, 0)p(k, 0> 

where 


p( k, /) =■ exp[ik • r(/)] 
First prove that 


a. 2 f dte-'“'< P (-K 0)p(k, f)> = I dt e-‘“‘<p(-k, 0)p(k, t)> 

J -to J -ao 

Now define the generalized diffusion coefficient tensor D(k, to) b\y 
1 r* 

Dab( k, cu) = - dt e~ ita \v b (0)e~ iu ' ri0) v a (t)e iu ' riO y 

3 J o 


and prove that 


1 


Smc(k, oj) = — - k • Re D(k, w) • k 


7TW 


Show that for an isotropic medium 
k 2 D 


5inc(k, to) ”*■ 


as /c ~>0 and as w 


TTOJ" 


0 


in agreement with Eq. (22-42). 

Zwanzig actually does not assume the classical limit and derives the general result 

f Bfi & 1 

5,nc(k ’ W) | w(l - e~^) \ k ' RC ° (k ’ W) ‘ k 
Derive this. 

22-56. Prove that if the memory function associated with the velocity autocorrelation 
function has the form K(t) — ye - * 1 ' 1 , then 

<F, 2 > 


K{0) 


m 2 < vi 2 > 



APPENDIX A 


VALUES OF SOME PHYSICAL 
CONSTANTS AND ENERGY 
CONVERSION FACTORS 


Table A-l . Values of some physical constants 


quantity 

symbol 

value 

Avogadro’s number 

No 

6.0222 x 10 23 

Planck constant 

h 

6.6262 x 10“ 27 erg-sec 


h 

1.0546 x 10“ 27 erg-sec 

Boltzmann constant 

k 

1.3806 x 10“ 16 erg/molecule-deg K 

gas constant 

R 

8.3143 x 10 7 ergs/mole-deg K 

1.9872 cal/mole-deg K 

speed of light 

c 

2.9979 x 10 10 cm/sec 

proton charge 

e 

4.8032 x 10-‘° esu 

electron mass 

trie 

9.1096 X 10~ 28 g 

atomic mass unit 

amu 

1.6605 x 10- 24 g 

Bohr magneton 

Mb 

9.2741 x 10 -21 erg/gauss 

nuclear magneton 

M* 

5.0509 x 10" 24 erg/gauss 


Source: B. N. Taylor, W. H. Parker, and D. N. Langenberg, Rev. Mod. Rhys., 41, p. 375, 1969. 
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APPENDIX B 


FOURIER INTEGRALS AND THE 
DIRAC DELTA FUNCTION 


The Fourier transform g(k) of the function f(x) is defined by 

9 {k) = ( 2 ~yn J_ f( x ) e ~ ik * dx ( B-1 ) 

The Fourier integral theorem states that if f(x) is single valued and periodic with at 
most a finite number of finite discontinuities, maxima, and minima, and 

f \f(x)\ 2 dx 

J — co 

is finite, then 

m = ^17 1 ff(k)e ikx dk (B— 2 ) 

Equations (B-1) and (B-2) are said to be a Fourier transform pair. Other definitions 
of the Fourier transforms are in use, differing by numerical factors and by the use of 
complex conjugates of the integrands given here. 

In vector notation, if V(x) is a function of the vector r = (x, jy, z) and F(k) its 
Fourier transform, then 

OO 

F(k) = (2 W 2 W F(r)e_ik r dr (B_3) 

— OO 

and 

OO 


(B-4) 
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Fourier transforms can be used in a formal manner to obtain a useful representation 
for the Dirac delta function. The Dirac delta function is defined by 


f <5(x — a)€>(x) dx = <E>(a) 
•'-00 

f 5(x) dx = 1 

^ — co 

The Fourier transform of <5(*) is 


tW 2 J 


dx = 


(In) 1 ' 2 (2n) 112 

The formal inverse gives the desired relation: 


(B — 5) 


(B-6) 


(B— 7) 


1 r m 


e iux du 


(B-8) 


More generally, in three dimensions. 


oo 


^ = rl^JJP krA 


(B-9) 


(B — 1 0) 


where <5(k) = S(k x )5(k y )d(k z ). 
A triple integral such as 


oo 


(B — 1 1 ) 


may be converted to a single integral, and hence more easily handled, by converting 
to spherical coordinates: 

dr = dx dy dz -► r 2 sin 6 dr dO d<}> 
k • r -► kr cos 6 
and so 

F(|k,) = C Jo J o 2 V(|r|)e“ i ' 1 — V dr sin 6 d6 d<}> 

The integral over $ gives 2n. The integral over 6 gives 




sin kr 


(B — 1 2) 


In order to evaluate Fourier transforms, the identity e iz = cos z 4- i sin z is often 
useful if one remembers that 


f /(*) dx = 2 [ f{x) dx if f(x) is even 

*’-00 J 0 

J /(*) dx = 0 


if f(x ) is odd 
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For example, consider the Fourier transform of l/(x 2 + a 2 ) 


,oo e ~ ikx 


4>(k) = (2n) 1/2 J ~y ^ dx 


a, X + a 

f coskxdx (n\ m t 

- 2(2n) Jo a ' 


ka 


0 XT + 

You should be able to invert 4>(k) to get (x 2 + a 2 ) -1 . 

PROBLEMS 

B-l. Show that the Fourier transform of f(x) = e~ alxl is 



a 

k 2 + a 2 


B-2. Show that the Fourier transform of e~ x * 2 is (2X)~ 1J2 e~ k2, * x . 
B-3. Solve the equation 


e~ w — cos atf(t)dt 

•'o 

for /( x) given that /( x) is an even function of x. 

B-4. Given that /( r) = Ne~ r2, ' a2 where r — |r[ (three dimensions), show that its Fourier 
transform </>(k) = tf 3 Afe _ * 2fl2 /2 3/2 . 

B-5. The delta function is not a function in the strict sense, but has meaning only if it 
occurs under an integral sign. For example, we can assign a meaning to jc8(x) by multiplying 
by an arbitrary but continuous function /( x) and integrating to get 

oo 

J f(x)x8(x)dx = 0 

—oo 

and thus one often sees jc8(jc) — 0. In a similar way, show that 
x8'(x) = -8(x) 


and 


B-5. Show that 


8(x — a) cos xdx — cos a 

" — 00 

by first evaluating 

r °° 

1(g) = (2tt<t 2 ) “ 1/2 cos xe ~ ix ~ 0)2/2 ® 2 dx 

J -CO 

and then taking the limit of cr ->0. This shows that Eq. (1-72) with cr-*0 is one representa- 
tion of the delta function. When in doubt, this is always a sure way of handling 8(x). 

B-7. Another representation of 8(x) was used in deriving Eq. (10-131). Let u k (x) be an 
orthonormal complete set. Let tp(x) be a suitably arbitrary function which can be expanded 
in terms of the {w*(x)} : 


<P(x) =£ a k u k (x) 
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First find the a k by multiplying by u k *(x) and integrating and then substituting the result back 
into the above equation to get 

4 >(*) = J [z «.*(*>.(*)] ton dx' 

Conclude from this that 

$(x — x') = Y, u k *(x')u k (x) 

k 

Generalize this result to ^-dimensions. 
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Source : K. S. Pitzer, Quantum Chemistry (Englewood Cliffs, N.J.: Prentice-Hall, 1953). 



APPENDIX D 


HARD-SPHERE RADIAL 
DISTRIBUTION FUNCTION 


A listing of a FORTRAN program that gives essentially the exact hard-sphere radial 
distribution function is shown below. It numerically inverts the Laplace transform 
of the Percus-Yevick rg(r) (cf. G. J. Throop and R. J. Bearman, J. Chem. Phys ., 42 , 
p. 2408, 1965) and adds on a correction due to Verlet and Weis (Phys. Rev., A5, 
p. 939, 1972) that brings g(r) into agreement with the computer (Monte Carlo and 
molecular dynamics) results. The author wishes to thank D. Henderson for permis- 
sion to present this program here. 
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APPENDIX D HARD-SPHERE RADIAL DISTRIBUTION FUNCTION 


FUNCTION GR« R HB** 1 > 

CALCul-ATI9N Of GO(R) USING HENnEPSONS PROGRAM 

CBRRpCTISh IS Ma£e TO RiY Oo#R, uSInG VERLET AND WEIS METHbO 

RHB , De-NIST^*SiSmA##3 « R/sIGMA 

COMPLEX EX12* IGi 0 2, IG2 C 2* 102,5, lG2 2 2, IG3 C 2, IG312, IG322* IG33|* 
, IG4 q2, IG4i2* I0A22<IGa32, I q ^ 2 £ lT2*EX2* 

3IXL1CA IXL2Ci IXL3C, IXL4C, IXL5 C, tLTC* jX, ISPC* ISDPC 
REAL LT, LTi*ET2,Lp 
P AT A P I E* K* * r bS/3 . 1 4 1 59265 * 0* 0 . C > 

IF ( K <NE> Cl OB TB 1 
X ■ 2»* p lE/ 3 * 

JX * CEXP(cn e LXtO.O,X) j 
K * 1 

1 IF ( abS(RH3 - RUBS, ,Lt. i.E-j,, g 9 TB 1 C 


RHBS ■ RHB 

ETAC * PlE/6»*RriB 

ETA ^ ETAC*I1. -‘ETAC/U.I 

ETAM 9 1. • eta 

B8T1 p etam** 4 * 

TOP ; 1 , * 2**ETA 
TBPj a tBP*T*P - 
Cl p T9Pi/bSTi 
B8T2 * 4**i3BT1 
T0PP * 2. ♦ Eta 
T8P2 ■ TBPP # TB p P 
C2 k -TBP2/B&T2 
C2 ■ 6.*ETA*C2 
C3 p ETA*Cl/2* 

*1 p o*75*tTA*ETf * t x . 1 ETA*( 0 ,7il7 ♦ 0*11 4 *ETA) )/BBti 
A2 p 24 .*a 1 ?EtA*EtAmp E taP/U, + c»5*ETA> 

A3pC|jBEr ( ETAC/ETA ) 

ETAig . 12 ,#Et* 

i> t i ■ i< « e I*/ 2: 

LT2 , 1 + 2**ETA 
l*P * *-Tl 

XETA ■ l./(X. - ET*) 

ETA3 ■ (1, - ElA)**2 

SP1 , 1. 

SP2 . 4.*ET a *xETA 
SP3 i 6 ..(EI a *XETa )„2 
SDPj p 2« 

SDP2 * S p 2 

stp ; 2. 

FF p (.eta + 3* )*ETA + ?• 

PAR, s CRT(liOT2*0*(ETA* # 2/FF)**5j 

YP'CuSERti.OTpAR) 

YM.CjBERt 1,0-PAR I 
PA'r# C U9ER(2*D»ETA,FF) 

Ti p XETA*(»2,*eIa + pAR*(YP ♦ Y V H 

IT2 , XETa* ( “ 2 » *Et* 4 raRp ( Yp.jX ♦ yVJXj) 

XF 1 . 3.*ET A 3 
PAR a ETA3**2/ETf 
XF2 • PaR»3;/4» 

par a PAR*eT a 3/EIa 
XF3 ; par* 3 */s» 
par ; par*eTa 3 /eIa 

XF4 . PAR*9»732." 

LT a LTl*Tl* LT2 

SDP ; SD p l*Tl '♦ S>DP2 

SP p (SPl.Tl ♦ SP2 >pT 1 ♦ S p 3 

EXll p EXP(-TI) 

XL1 a (LT/3p 1**2 

XL2 i 15*.sD p *#2 - 4 .*«ip,STP 

XL3 . LT ♦ A..Ti*l p 

XL4 , LT*S3 p XS p 

XL5 a 2.*LT * 3.*T1»L D 

5101 ■ T1*LT'XF1/S p 

0201 * -LT/XF2/ SP**? 

G2li p LT*(l! • Il*(2. + SDp/Sp) ) ♦ 2.*Lp*Ti 
G22i p LT.Tl 



o.oo nan 
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G301 ■ LT/X f3 / s P**3 

G311 * LT**2*Tl/ s P**2*( 3 **SDp #*2 * sP*ST p ) • 3.«LT*S0P/SP*( 
I CT * 3.*TI«LP) * 6 .*LP#(LT * l p *T1 ) 

G32l * L T *t £|, *l-P*Tl * lT«( 2 * • 3.*SC D *Ti/s p ) ) 

G331 * g2£1*CT' 

G401 * •LT/X F ''*/SP**fc 
G4ii 


* ? , *T1*XL1*lT/Sp»sDP*(2.*Sp;STP -_3v*SDP**8)_4 XLJ< 

L p *XL5))*LT 


1 XL2*XL3 • 2*.«L h *Xl**xl5 + 12.*L p **2*(3.*LT * 2.*Tl*L p 


G48l * (Tl*XLl«XL 2 . 1 2 .*(XL4 *xL3 

G431 p(» 6.*T1«XL* ♦ 3.#xL3>*LT**P 

G44J ■ G33l"*l.T 

ILTC » U T 1*1I2 ♦ LT2 
JSDPr p SDPl*JT2 ♦ SDp? 

ISPC * (SP1*IT2 * SP2j,|T2 * S d 3 
EX18 « CExp<-tT2> 

IXLlc • t ILlP/lSj*C ) *«2 

IXL2C ■ 15.*ISDpC** 2 • 4,*JS p C*SrP 

IXL3C • ILT C ♦ 4**lT2*iP 

1XLj»C ■ iltc*isdPc/ispc 

IXL5C ■ 2.*ILTC ♦ 3.«lTg*LP 

IGlOg • lT2*iLTC^X p l/ jspC 

tsifi : WcW'.'Vi !<|. ♦ IsDpC/ISpC)) ♦ 2«*L p * IT? 
ieic I ■ Iltc^xfI/jSpc*,? 

l?3l2 • lLT c# *2* 1 T2/IS p C** 2 *<3.*I SDpC **2 • lS p C*STP> • 3 .*|LtC* 

{ ISDpC/ISPC*! IL tC ♦ 3.*IT2*L p ) * 6**L p *tJLTC ♦ LP»IT2> 

I?32p ■ lLTC*(6.*|, p *lTg * lLTC*(P* • 3**ISD p C*IT2/IS p C)) 

IG332 « 1G222»Il j[C 
IP402 . -ILTC/XF4/ISPC..4 

IG4i 2 ■ 5.*n2*lXLiC*ILTC/IS p C*ISDPC*(2»*l9 p C*STP • 3**ISppC**2> 

1 ♦ lxLlC*IXLeC*I*L3C i '>4,*LP*tXL 1 »CplXL5e ♦ l2.*LP**?*(3.*lLTC 

2 ♦ 2,*IT2*lH 

IG422 • UT2*IXL1C*IXL2C • 12. *( IXL<*C*IXL3C - Lp*!XL5C > )*I|_TC 
1G*3? * t-6?*lT2*lXL4C + 3»*Ixr 3C)*ILTC*#2 
• IQ332#ILJC 


lC R " Rl*A3 

IF (R •GE« *9999999 • AVD* R *LE* 5* ) GB TB 1010 
IF (q «GT« 5* )Ge TB loll 
R3 9 R#R#R 

GR 9 Cl ♦ C^*R ♦ C3 # R3 
return 


lCU GR 9 1 • 

RETURN 

iQlC RR G . 0* 

FIRST shell CbNTRjButibn 


EX1 f Exll*EXPtR*Tl) 
EX2 . EX12*CEXP(R*lT2| 
RRG . RRG ♦ G101 *eXj 4. 
IF(R »LE« 2»>G6 Ib 99 


2«*lGlop*E x2 


SECBnD shell contribution 


8 

c 


8 


EX1 , EXl.EXll 
EX2 . EX2*EXl2 

RRG p RRG ♦'G201 *eX 1*(S211 + G?2l*R> ♦ 2,*lG202*EX2*< 13212 ♦ 
1 IG222*R) 

IF (R »LE* 3»>GB Is 99 

THJR6 SHELL CBNTHJBUTIBN 


RX p R . 3. 

EXl , EXUEXll 
EX2 . EX2*EXl2 

RRG p RRG ♦ G301* E Xl* ( ( e 33l*Rx ♦ G32l)*Rx ♦ G3U> ♦ 2**IG303* 
1 Ex2*C(JG332*rX ♦ Ig 322J*RX + »G3l2) 

IF (p »LE. *!>Ge TO 99 

FOURTH shell C0NTrIBUti6N 
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RX i R • 4 , 

Exi ; exi*exu 

ex? , EX2*EXl? 

RRQ p RRG ♦ Q4C1 # e x 1*( ( <G4*1*R* ♦ G*31)*rX ♦ G42l>*RX ♦ Q 4 HI ♦ 
1 2t#EX2*IQ4C2 #( (<jG 442* rX ♦ IS*3 ?>>*rX ♦ tG*22)*rX ♦ jG412) 

9S Z w a2#(R1 - 1*?) 

GR P RRG/R ♦ Ai/Ri#EXP(-z> #c ®S}Z) 

RETURN 

END 


GR 

GR 

GR 

OR 

GR 

GR 


FUNCTION CU^ER(X) 


iecx.lt. C. c> T e 

CUBEp«X**(l* D /3.9) 
RETURN 
lc AX**y 

AA«a5##{1. 0 /3.C) 

CUbEp«-AA 

Return 

end 


lc 



APPENDIX E 


TABLES FOR THE m-6-8 POTENTIAL 


The second virial coefficient, collision integrals, and Chapman-Enskog correction 
factors for the m- 6-8 potential. 


u(r) 

£ 


STT6 (6 + ^ 



— L_ 8)] (^) 


where £ is the depth of the well, o is the distance at which u = 0, i.e., u(a) = 0, r m is 
the distance at which u is a minimum, i.e., 1/(0 = — £, and d = rja. In the tables, 
m = 11, y = 3, and T* = kT/e. The author wishes to thank Howard Hanley for per- 
mission to present these tables here. 
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Second virial coefficients for the 1 1-6-8 (y = 3.0) potential 


T * 

B* 

0.60000 

-4.87068 

0.65000 

-4.17866 

0.70000 

-3.63207 

0.75000 

-3.19016 

0.80000 

-2.82595 

0.85000 

-2.52090 

0.90000 

-2.26188 

0.95000 

-2.03934 

1.00000 

-1.84619 

1.10000 

1.52774 

1.20000 

- 1.27637 

1.30000 

-1.07314 

1.40000 

-0.90561 

1.50000 

0.76526 

1.60000 

-0.64607 

1.70000 

-0.54368 

1.80000 

-0.45482 

1.90000 

-0.37705 

2.00000 

0.30844 

2.20000 

-0.19307 

2.40000 

-0.10001 

2.60000 

-0.02353 

2.80000 

0.04033 

3.00000 

0.09434 

3.20000 

0.14054 

3.40000 

0.18044 

3.60000 

0.21519 

3.80000 

0.24568 

4.00000 

0.27260 

4.50000 

0.32768 

5.00000 

0.36984 

5.50000 

0.40287 

6.00000 

0.42925 

6.50000 

0.45064 

7.00000 

0.46820 

7.50000 

0.48277 

8.00000 

0.49495 

8.50000 

0.50522 

9.00000 

0.51392 

9.50000 

0.52134 

10.00000 

0.52768 

12.00000 

0.54528 

14.00000 

0.55495 

16.00000 

0.56008 

18.00000 

0.56244 

20.00000 

0.56304 

26.00000 

0.55939 

30.00000 

0.55484 
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Collision Integrals for the 1 1-6-8 (y = 3.0) potential 


T* 

ft* 1 -” 4 




ft< 2 ’ 3 >* 

ft< 3 ’ 3 >* 

0.6000 

1.77495683 

1.50044068 

1.97293436 

1.31799182 

1.75119807 

1.62664569 

0.6500 

1.71029244 

1.44430297 

1.90393332 

1.27150018 

1.68427961 

1.56699361 

0.7000 

1.65268780 

1.39511086 

1.83999062 

1.23134857 

1.62310690 

1.51434932 

0.7500 

1.59908326 

1.34980755 

1.77844390 

1.19570368 

1.56853486 

1.46629105 

0.8000 

1.55313888 

1.31078609 

1.72505031 

1.16477532 

1.52060206 

1.42331600 

0.8500 

1.50943774 

1.27685162 

1.67759146 

1.13794402 

1.47708942 

1.38516896 

0.9000 

1.46922464 

1.24690235 

1.63261115 

1.11429982 

1.43791883 

1.35111649 

0.9500 

1.43430228 

1.21885599 

1.59074373 

1.09308629 

1.40277761 

1.32011258 

1.0000 

1.40190117 

1.19315885 

1.55245548 

1.07387989 

1.37099892 

1.29166006 

1.1000 

1.34357947 

1.14923890 

1.48592018 

1.04096448 

1.31621990 

1.24211144 

1.2000 

1.29400805 

1.11350836 

1.42835730 

1.01376097 

1.27020089 

1.20067018 

1.3000 

1.25350469 

1.08291541 

1.37936665 

0.99076132 

1.23131087 

1.16550068 

1.4000 

1.21753570 

1.05644518 

1.33706527 

0.97101082 

1.19832065 

1.13518176 

1.5000 

1.18421312 

1.03359443 

1.30037160 

0.95379473 

1.16989256 

1.10879871 

1.6000 

1.15504628 

1.01369579 

1.26748280 

0.93866542 

1.14515079 

1.08571844 

1.7000 

1.12997253 

0.99605801 

1.23842116 

0.92523500 

1.12343252 

1.06539195 

1.8000 

1.10801605 

0.98032225 

1.21281020 

0.91321359 

1.10414466 

1.04735142 

1.9000 

1.08807262 

0.96617115 

1.19000803 

0.90233838 

1.08693482 

1.03119717 

2.0000 

1.06958556 

0.95330369 

1.16943369 

0.89242579 

1.07150251 

1.01661888 

2.2000 

1.03769859 

0.93098908 

1.13394880 

0.87496953 

1.04494356 

0.99132506 

2.4000 

1.01099595 

0.91222000 

1.10430496 

0.86001408 

1.02282001 

0.97012749 

2.6000 

0.98812353 

0.89605171 

1.07911862 

0.84700946 

1.00399848 

0.95198017 

2.8000 

0.96833164 

0.88188385 

1.05760250 

0.83545818 

0.98777679 

0.93639700 

3.0000 

0.95078230 

0.86935931 

1.03897395 

0.82511265 

0.97355794 

0.92267975 

3.2000 

0.93533311 

0.85817805 

1.02262673 

0.81576952 

0.96097407 

0.91051120 

3.4000 

0.92165145 

0.84808145 

1.00809869 

0.80725371 

0.94971433 

0.89959636 

3.6000 

0.90944407 

0.83888754 

0.99507016 

0.79943579 

0.93955025 

0.88972610 

3.8000 

0.89830455 

0.83048268 

0.98332895 

0.79220960 

0.93029732 

0.88074277 

4.0000 

0.88805170 

0.82273302 

0.97266670 

0.78549248 

0.92181381 

0.87251447 

4.5000 

0.86583179 

0.80570453 

0.94972898 

0.77063955 

0.90331730 

0.85454060 

5.0000 

0.84748180 

0.79123150 

0.93087726 

0.75764824 

0.88775363 

0.83942074 

5.5000 

0.83178498 

0.77866684 

0.91494074 

0.74632705 

0.87436348 

0.82639695 

6.0000 

0.81824763 

0.76758447 

0.90118200 

0.73623864 

0.86261 104 

0.81497356 

6.5000 

0.80636877 

0.75767422 

0.88909357 

0.72714416 

0.85214217 

0.80481054 

7.0000 

0.79575232 

0.74872313 

0.87833881 

0.71886917 

0.84270646 

0.79565537 

7.5000 

0.78617885 

0.74056466 

0.86866131 

0.71128089 

0.83411929 

0.78732605 

8.0000 

0.77746627 

0.73306709 

0.85986201 

0.70427535 

0.82623927 

0.77968899 

8.5000 

0.76948072 

0.72613592 

0.85180140 

0.69777133 

0.81895853 

0.77263782 

9.0000 

0.76211483 

0.71969252 

0.84436560 

0.69170305 

0.81219147 

0.76608977 

9.5000 

0.75528526 

0.71367522 

0.83746685 

0.68601729 

0.80587034 

0.75997739 

10.0000 

0.74893100 

0.70803109 

0.83103386 

0.68066998 

0.79993999 

0.75424598 

12.0000 

0.72717121 

0.68842664 

0.80892213 

0.66200350 

0.77931652 

0.73434495 

14.0000 

0.70961693 

0.67234483 

0.79097716 

0.64660102 

0.76234894 

0.71800629 

16.0000 

0.69491929 

0.65873431 

0.77587443 

0.63351590 

0.74794007 

0.70415536 

18.0000 

0.68228800 

0.64695238 

0.76283719 

0.62216028 

0.73542548 

0.69214297 

20.0000 

0.67122838 

0.63657807 

0.75137094 

0.61214403 

0.72437144 

0.68154551 

26.0000 

0.64454920 

0.61136970 

0.72350146 

0.58775833 

0.69736849 

0.65570675 

30.0000 

0.63045621 

0.59797285 

0.70865817 

0.57478102 

0.68293315 

0.64192048 
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Collision integral combinations for the 1 1-6-8 (y — 3.0) potential 


T * 

A 

fx 

Id 

0.600 

1.0002 

1.0002 

1.0001 

0.650 

1.0001 

1.0001 

1.0001 

0.700 

1.0000 

1.0001 

1.0001 

0.750 

1.0000 

1.0001 

1.0001 

0.800 

1.0000 

1.0001 

1.0001 

0.850 

1.0000 

1.0000 

1.0001 

0.900 

1.0000 

1.0001 

1.0001 

0.950 

1.0000 

1.0001 

1.0002 

1.000 

1.0001 

1.0001 

1.0002 

1.100 

1.0001 

1.0002 

1.0003 

1.200 

1.0002 

1.0003 

1.0005 

1.300 

1.0003 

1.0005 

1.0006 

1.400 

1.0004 

1.0007 

1.0007 

1.500 

1.0006 

1.0009 

1.0010 

1.600 

1.0008 

1.0012 

1.0012 

1.700 

1.0010 

1.0016 

1.0015 

1.800 

1.0012 

1.0019 

1.0017 

1.900 

1.0014 

1.0022 

1.0019 

2.000 

1.0017 

1.0026 

1.0021 

2.200 

1.0021 

1.0033 

1.0026 

2.400 

1.0026 

1.0040 

1.0030 

2.600 

1.0030 

1.0047 

1.0034 

2.800 

1.0034 

1.0053 

1.0038 

3.000 

1.0038 

1.0059 

1.0041 

3.200 

1.0041 

1.0064 

1.0044 

3.400 

1.0044 

1.0069 

1.0047 

3.600 

1.0047 

1.0073 

1.0050 

3.800 

1.0049 

1.0077 

1.0052 

4.000 

1.0052 

1.0081 

1.0054 

4.500 

1.0057 

1.0088 

1.0059 

5.000 

1.0061 

1.0094 

1.0063 

5.500 

1.0064 

1.0099 

1.0066 

6.000 

1.0066 

1.0103 

1.0069 

6.500 

1.0068 

1.0106 

1.0071 

7.000 

1.0070 

1.0109 

1.0072 

7.500 

1.0071 

1.0111 

1.0074 

8.000 

1.0072 

1.0112 

1.0075 

8.500 

1.0073 

1.0114 

1.0076 

9.000 

1.0074 

1.0115 

1.0077 

9.500 

1.0074 

1.0116 

1.0078 

10.000 

1.0075 

1.0117 

1.0078 

12.000 

1.0077 

1.0119 

1.0080 

14.000 

1.0077 

1.0120 

1.0081 

16.000 

1.0078 

1.0121 

1.0082 

18.000 

1.0078 

1.0121 

1.0082 

20.000 

1.0078 

1.0121 

1.0082 





APPENDIX F 


DERIVATION OF THE GOLDEN RULE 
OF PERTURBATION THEORY 


Consider a system whose Hamiltonian can be written in the form 

^=^ o + AF(0 (F— 1 ) 

where A is small, 0 is independent of time, and V(t) is a function of time and the 
spatial coordinates of the system. The system obeys the time-dependent Schrodinger 
equation 

ih — = Jf'i* (F— 2) 


The eigenfunctions ij/ m and eigenvalues E m of are given by 

JP 0 ip m =E m ij, m (F-3) 

We assume that these are known and that the eigenfunctions form a complete ortho- 
normal set. 

Now if #£ in Eq. (F-2) were independent of time, then *F(f) = would 

be a solution. This suggests that we expand ¥(f) as 

*(0 = I (F— 4) 

m 

Substitute this into Eq. (F-2) to get 

itl + E a m ijj n E m e~ iEm,,h 

m a* m 

= 'Z a m'l'm E m +^Y. Va m>Pm e~ iB ” flh 

m m 


Now multiply by ij/ k * 9 integrate over all space, and use the orthonormal property of 
the ij/j to get 




(F— 5) 
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where is the kn matrix element of V(t), i.e., 

(F— 6) 

and tio kn =E k -E n . 

The set of equations for the a k (t) 9 Eq. (F-5), is equivalent to the time-dependent 
Schrodinger equation, Eq. (F-2). This set of equations can be solved by successive 
approximation by writing 


*„= £ A v° 

i= 0 

The substitution of this into Eq. (F-5) gives to linear powers in X 
0 n (O) = const 

and 


(F— 7) 


(F-8) 


(F-9) 


Suppose now that initially the system is known to be in the stationary state \j / m , 
so that ¥(0) = ip m , and that the time-dependent perturbation kV(t) is independent 
of time except for being turned on at time 0. Then o t (0) = 5 km and Eq. (F-9) becomes 

ih ~ t > 0 (F— 1 0) 


or 


(i) 



H tm (1) e ,w dt' 


H km (l) - 1 
h a> km 


k ^ m 


Now according to Eq. (F-4) (assuming T normalized). 


(F-11) 


r^mo^= £ i a m(oi 2 = i 

J w=0 

and so \a m (t)\ 2 has the interpretation of being the probability that the system is in 
the state m at time t . To first order in X then 


|fl* (1) | 2 = 4|f/ ra * <1 >| 2 


sin 2 \a> kn t 

fc 2 «w 2 


( F— 1 2) 


This is the (first-order) probability of transition from state m to state k^m. This 
probability is shown as a function of E k in Fig. F-l. As t increases, the height of the 
central portion of the curve in Fig. F-l increases as t 2 and its width decreases as l/t. 
The other peaks simply oscillate in time. Thus the most likely transitions are those to 
states whose energies lie under the central peak in Fig. F-l. Since this peak is given 
by the first zeros of sin jc, then x = ±n, and so the energies of the most probable states 
satisfy 
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Figure F-l. A plot of sin 2 ^w km tlw km 2 versns co kmj showing how the central peak grows as t 2 and 
narrows as 1 It. 


Suppose now that there is actually a continuum of states about the state k. Then 
we would wish to know the probability that there is a transition into this group of 
states, rather than to just the one state k. If we assume that |/7 fcro (1) | 2 is a smooth, 
slowly varying function of k and let p(E k ) be the density of states about E k , then the 
probability of making a transition from m to a state about k is' 


P=\H km w 



sin(qW/2) ~| 2 

Km/2) J 


(F— 1 4) 


where the asterisk indicates that we are integrating only over a small region around 
E k . Now for large enough t> the central peak must include all the states around E k . 
If we assume that p(E k ) is not a strong function of E k , we can take p(E k ) out from 
under the integral sign to get 


P=\H k J l Y 


f r sin tom*/2)' 


dE L 


(F-15) 


But for large t the area under the central peak is essentially all the area, so we can 
extend the limits here to + oo and use the fact that 


J 


00 sin 2 * 


dx =n 


to get finally 
P = Tt 

where T, the transition rate, is given by 
T = 2 {\H km ^p{E k ) 


(F— 1 6) 


(F— 1 7) 
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where \E m — E k \ < 2nh/t. This result is known as FermVs golden rule and is a well- 
known famous result of first-order time-dependent perturbation theory. 

A similar result can be obtained by noticing that as t becomes large, [sin(co km tj 2)/ 
( co k J2 )] 2 becomes more and more peaked about E k = E m , has total area 2nt/h, and 
hence approaches (2nt/h) S(E k — E„) y except for the negligible oscillations in the 
wings. Thus we again have 

P = Tt 

with 

r = ~\H k J l YS(E k ~EJ (F— 1 8) 

Remember that to use this formula numerically, we must sum P over a continuous 
group of final states. Baym (cf. “ References ”) gives a thorough discussion of the 
limit of the validity of the golden rule. 

In deriving the golden rule, we have assumed that the perturbation is turned on 
suddenly at t =0. It is often more convenient to “turn on” the perturbation very 
slowly. This can be done by turning on the perturbation 

V(t)=e*V 0 <t 

= V t> 0 


where rj > 0 in the infinite past, and then letting r\ 0 in the final results. Since the 
derivation of the golden rule requires that t be large, the result must be independent 
of how the perturbation was initially introduced. 

In Section 21-1, 


XV(t) = —E 0 M • € cos cot 

= - E°™' e (<■••“" + e~ ia ‘) 

where E 0 is the amplitude of an electric field of frequency a>, £ is a unit vector along 
its direction, and M is the dipole moment of the system. It is straightforward to 
show in this case that 


fl<‘> = - 


E 0 if |M • £| i> - 1 _ i 


2 h 




and 


co fi — CO 


COfi 


U))f _ J - 

+ C0 


sin 2 i(c o fi — co)t sin 2 \{co fi -I- c o)t 


8 cos cot sin ^(co + co /£ )f sin o — co /£ )n 
(co - co r j)(co + (Oft) J 


and that 




nE 0 2 
2 h 2 


| </|s • M|i>| 2 [^(co /J - co) + 6((o fi + co)] 


(F— 1 9) 
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APPENDIX G 


THE DIRAC BRA AND KET 
NOTATION 


It is common in quantum-mechanical derivations to have integrals such as 
\'¥*'¥ m dq or J dq 

where M is some operator. It is convenient to define a scalar or inner product (a, b) 
of two functions a and b by 

(a, b) = J a*b dq (G-1 ) 

where we assume that both a and b are suitably well behaved over the range of inte- 
gration. In this notation, the above two integrals are (¥„, ¥ m ) and (¥„, 

If A is a scalar (possibly complex), the inner product has the easily proved properties 
(Problem G-1) 


('E„,'PJ* = (T m ,'P n ) 

(G— 2) 

('P„,A'E ra ) = ACP„,'PJ 

(G— 3) 

W.. 'TJ=A*(T n , VJ 

(G— 4) 


The inner product defined here is similar to the inner or dot product of vectors. 
Consider two three-dimensional vectors, a and b 

a = a x z x + a y z y + a z z z 

■ w , . , . (G— 5) 

D + UyZy + D z Zg 

Since and e z form an orthonormal set, i.e., s t • = 5 y , we see that 

a-b = a x b x + a y b y + a z b, (G-6) 

If we extend this to an A-dimensional space, for example, still with s t ■ &j = 3 tj , we 
get 

a • b = £ djbj 

i= i 


(G— 7) 
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It is possible to extend this further to an infinitely dimensional function space. 
Instead of a simple dot product, we use the above definition of an inner product as 
as an integral. Thus consider a complete orthonormal set {<f>j(x)}. By orthonormal we 
mean that 


J dx = dij 


and by complete we mean that this set spans the space ; i.e., there are no other functions 
orthonormal to the set {</>/*)}. This means that all functions defined in the “ space ” 
spanned by the set {$/*)} can be expanded as 

/(*) = (G-8) 

Note the analogy with Eq. (G-5). If h(x) = £ h } 4>pc), then their inner product is 

(/,(*),/(*)) = £ h t *fj f dx 

i,j J 

IV/A= E hj% (G-9) 

i.j J 

in analogy with Eq. (G-7). Just as the length of a three-dimensional vector is given by 
|a| = (a • a ) 1/2 = (a 2 + a 2 + a 2 ) 1 ' 2 
the “length” of /(x) is given by 

ll/WII = (f, f) vl = ( £/,*/,-) ‘^ (G-1 0) 

Similarly, just as two three-dimensional vectors are orthogonal (perpendicular) if 
their dot product is zero, we say that the two functions h(x) and f(x) are orthogonal 
if their inner product vanishes. These ideas are a brief introduction to the beautiful 
mathematical concepts of vector space, Hilbert space, etc. The works by Friedman, 
Merzbacher, and Jackson (“ References ”) discuss these in more detail. 

We are now ready to discuss an elegant notation due to Dirac. The state of a system 
is described by a wave function, which may be considered to be a vector in some space. 
Hence we call the wave function a state vector and represent it by | m>. The set of all 
state vectors forms a vector space. The set of all possible prefactors (jn\ forms another 
vector space, said to be the dual space. We set up a set of rules for operations in these 
two spaces such as 

k>+ |6>« K^l +<fc| (G— 1 1 ) 

A | a>< (G-1 2) 

The two spaces are connected by defining the product of a prefactor with a state 
vector by 


{a\by=Q¥ a ,^ b ) (G— 1 3) 

Since this represents a bracket notation for an inner product, the quantity <c | is called 
a bra vector and | by is called a ket vector. The two vector spaces are said to be dual 
vector spaces. Using this definition, it is easy to show that (Problem G-3) 

<a\by = <b\ay* (G-1 4) 

(alkby =A<fl|fc> (G-1 5) 

iXa\by=k*ia\by (G-1 6) 

This last line shows the motivation behind the definition in Eq. (G-1 2). 
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We can define an operator in the state space through 
I b>=A\a> 

If it so happens that 

A\a}=a\dy (G— 1 7) 

we say that | a) and a are an eigenfunction and an eigenvalue, respectively, of A. 
We can also introduce a set of orthonormal vectors, |/77>, that span the state space and 
expand a state vector | a) by (Problem G-4) 

|o>= £ <m|fl>|/w> (G— 1 8) 

m 

Note the analogy of this equation with Eq. (G-8), where fj = J dx. 

Since < [m\a > is a scalar, Eq. (G-18) can be written as 

\ a > = L l m > < m \ a > (G-19) 

m 

which shows that \m) (jm\ can be thought of as an operator that projects | a) onto 
| ra>. Equation (G-19) also shows that 

£ |m> <m| = 1 (G— 20) 

m 

which is called the completeness or closure relation. Generally the combination 
|fl> <fc| is an operator since it operates on a ket to give another ket. 
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PROBLEMS 
G-l. Prove that 

Q¥ ny '¥ m )* = ('¥ m ,'¥ n ) 

0F„, AY J = 

(AT n ,T m ) = A*('F n ,'F m ) 


G-2. Discuss the analogy between a simple three-dimensional vector space spanned by 
e x , B yt e- and the space of all functions that can be expanded in terms of (sin mx}. 

G-3. Prove that 

<b\a>* = <a\b> 

<a|A£> = A <i a\by 
<Aff|6> = A* <a\b> 

Compare these equations to those in Problem G-l. 

G-4. Derive Eq. (G-18). 

G-5. Write out Eq. (G-20) in terms of functions l F fI and integrals; i.e., what does this 
equation look like if one does not use Dirac notation? 



APPENDIX H 


THE HEISENBERG TIME-DEPENDENT 
REPRESENTATION 


Before discussing the title of this appendix, we must first introduce a number of 
definitions and notions of operators and functions of operators. Let A and B be 
operators and c x and c 2 be (possibly complex) scalars. Then we have 


(c x A +c 2 B)'¥ = c i A'¥ + C 2 BV (H-1) 

The successive application of two operators is defined through 

AB'¥=A(B't 0 (H — 2) 

Generally the order of application is important since often AB ^ BA. We define the 
commutator of A and B by 

[A, B] = (AB — BA) (H— 3) 

We can readily construct a polynomial function of an operator A by forming 

P n (A) = c 0 + c x A +c 2 A 2 + - • • + c n A n (H— 4) 

By considering the limit N -+ oo, we can define more general functions of A through 
power series. For example, 

+A+ h A,+ h A>+ - (H - 6) 

Note that (Problem H-1) 

e A+B *e A e B if [A,B]* 0 


If A*¥ = then it is clear that A*"¥ = a n% ¥ and f(AY¥ = f(dy¥ if /is an analytic 
function. 

There are a number of special kinds of operators. If (<£, A*¥) = ( A(j>, ¥), then A is 
said to be a Hermitian or self-adjoint operator. It is easy to prove that a Hermitian 
operator has real eigenvalues (Problem H-2). For this reason, the quantum-mechanical 
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operators that correspond to physical, observable properties are Hermitian. Generally 
an operator will not be self-adjoint. We define an operator called the adjoint of A , 
by 

(4>,A'¥) = (A'<f>,'¥) (H— 6) 

Clearly (/4 t ) t = A (Problem H-3). If A is Hermitian, A = A? It is also easy to prove 
that (Problem H-4) 

(c t A + c 2 By = Ci* A * + c 2 *B t 

and 

(AB) f = BU' 

A unitary operator is defined by 

UU' = U'U = I (H— 9) 


(H— 7) 
(H— 8) 


where / is the unit operator. A unitary operator corresponds to a rotation in a vector 
space. This is proved in the following manner: The length of a vector is (</>, 4>) l/2 - 
But (<p, <j>) = (4>, U r U(j>) = (U<j>, U<f>), which implies that \\(p\\ = || U<}>\\, where ||^|| 
denotes the length of (p. Also, it is easy to show that if (<p, *P) = 0, then ( U<p , U*¥) = 0, 
and so we see that the operation U must correspond to a rotation. 

If A is Hermitian and f(A) is a real function, then (Problem H-5) U = exp[i/(^4)] is 
a unitary operator. From I/ t I7 = UU* = /, it follows that if 'F = U<f>, then <j> = U^. 
Thus for a unitary operator, the adjoint C/ t has the property if being an inverse opera- 
tion as well, so that 

(H— i o) 

If an operator depends continuously on a continuous parameter t, then the deriva- 
tive of A is defined by 


dA 

dt 


,. A(t + e) - A(t ) 
lim 

E-»0 £ 


Problem H-6 has you prove that 


^b>.§ b + a 


dB 

dt 


and 


— - e iAt = iAe Ut 
dt 


(H— 1 1 ) 


(H — 1 2) 


(H — T 3) 


We are now ready to introduce the Heisenberg picture of time-dependent quantum 
mechanics. There are a number of different so-called representations of quantum 
mechanics. The two most common are probably the Schrodinger representation and the 
Heisenberg representation. In the Schrodinger picture, the temporal variation of a 
system is given by time-dependent wave functions, and the operators corresponding 
to observables are independent of time. In the Heisenberg picture the states of the 
system are represented by time-independent wave functions, and the operators are 
time dependent. 

Since the physically significant quantities of quantum mechanics are inner products, 
we require that 

/ Xo*A(ty¥ 0 dq = \ x^A^it) dq 


(H-14) 
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The first is the Heisenberg representation, and the second is the Schrodinger repre- 
sentation. Now for a time-independent Hamiltonian, the time-dependent Schrodinger 
equation 


ih — = MTV 
dt 

can be solved formally by 

y(r) = C/(0^(0) 


where 


17(0 = exp 


(-?) 


(H— 15) 


(H-1 6) 


(H— 17) 


This operator is defined through its power series. 

Substitute this into Eq. (H-1 4) to get 

(X o , 'f(O'J'o) = W)Xo , AoUWo) = (xo , V;A 0 U.'Vo) 

= (Xo ,U*A 0 U t ^ 0 ) (H-1 8) 

from which we see that 

A(t ) = U,*A 0 U, = e iJt " h A 0 (H-1 9) 

This gives the time dependence of the operator A(t). We can differentiate this to get 

ih ^ =it,dJ r A ° Ui+ihu * A ^ 


= -jru*A 0 u t + u*a 0 u t #e 

= -rfA t + A t 3fe = [A t ,MT] 


Thus, 


dA 

ih — = [A t ,jr] 


(H-20) 


where we have assumed that A 0 has no explicit time dependence. This equation is 
called the Heisenberg equation of motion. We have shown that Eq. (H-1 9) is a solu- 
tion to Eq. (H-20). 
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PROBLEMS 

H-1. Prove that * e A e B if [A, B] ± 0. 

H-2. Prove that the eigenvalues of a Hermitian operator are real. 

H-3. Prove that (A*)' = A and that (A- 1 )- 1 = A. 

H-4. Prove that (AB)* = £ f A* and that ( AE)~ X = B- l A~K 

H-5. Prove that if A is Hermitian and / is a real function, then U = exp[z/04)] is unitary. 
H-6. Prove that 

d{AE) dA dB 

-h- L = -r B ^-A— 
dt dt dt 


and 


de tAt 

— — = iAe 
dt 
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THE POYNTING FLUX VECTOR 


In this appendix, we shall introduce the concept of the Poynting flux vector and then 
derive Eq. (21-16). Consider a single particle of charge q moving with velocity v. 
The rate of work done by an external electromagnetic field is q\ • E. There is no con- 
tribution here from the magnetic part B since the magnetic force is perpendicular to 
v. For a continuous distribution of charge and current, then the total rate of work 
on the volume V is 


i 


J E dr 


(M) 


where J is the macroscopic current density. 
Now recall that Maxwell’s equations are 


div D = 4np 


div B = 0 


m 4 tt _ 1 dD 

curl H = — J H — 

c c ct 


ir 1 SB ^ 

curl Eh — = 0 

c ot 


d-2) 


with the constitutive relations D = eE, J = oE, and B = pH for an isotropic, con- 
ducting medium. 

We use the Maxwell equation for curl H to eliminate J in Eq. (I— 1) to get 


l 3 E *=st[‘*-< VxH >- E -?]* 

We now use the vector identity 

V • (E x H) = H • (V x E) - E • (V x H) 
and the Maxwell equation for curl E (Faraday’s law) to get 
r 1 r T dD dBl 

-l K J ' E,ir “4;lr (E><H) + E "a +H 'aFj‘ ir 


(1-3) 


(1-4) 


(1-5) 



APPENDIX I THE POYNTING FLUX VECTOR 619 


If we assume that the medium is linear, i.e., that e and /i are constants, then we can 
express the time derivatives in Eq. (1-5) in a more conventional form. Using the fact 
that D = eE and B = /iH, we can write 


1 

471 


(-f-f) 


lifl, 

8 tt dt™ 


D + B H) 


and if we identify (E • D + B • H)/87r as the energy density u, Eq. (1-5) can be written 
in the form 


and since the volume V is arbitrary, this can be written as 


d-6) 


^ + VS= - J E 
dt 


(1-7) 


This is the (energy) continuity equation. By comparing this with the usual continuity 
equations, we see that the vector S represents the flux of energy and is called the 
Poynting vector 


S= — (ExH) 

471 

and that the quantity 

«=^-(E D + H B) 

o7T 


( 1 - 8 ) 


(1-9) 


can, in fact, be identified as the energy density of the electromagnetic field. Equation 
(1-9) can also be derived more directly by considering the work done by electric and 
magnetic fields separately, but the derivation here is quite valid. 

We now wish to derive Eq. (21-16) from Eq. (1—8). Maxwell’s equations for a 
nonconducting medium with no free charges are 


div E = 0 


1T1 1 dB ^ 

curl Eh — =0 

c dt 


us dE 

div B = 0 curl B — = 0 

c dt 


( 1 - 10 ) 


It is a standard procedure to show from these equations that each Cartesian compo- 
nent of E and B satisfies the wave equation (Problem 1-1) 


v 2 c = 


v 2 dt 2 


( 1 - 11 ) 


where £ represents any of the components of E and B and 


v 


M 1/2 


(1-12) 
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Clearly v is the velocity of propagation of the electromagnetic wave through the me- 
dium characterized by e and fi. The solutions to Eq. (1-1 1) are the plane wave solutions, 
which have the form (Problem 1-2) 


where 


k = | k | = - = (^e) 1/2 - (1-14) 

V c 

By studying the properties of the solution Eq. (1-1 3), one sees that the wave propagates 
in the direction of the wave vector k (Problem 1-3). Although £ does satisfy the wave 
equation, since it represents components of E and B, it must also satisfy Maxwell’s 
equations as well. 

Let us assume that E and B are plane waves of the form 
E(r, t) = £ 1 E 0 e ,K ' r_ic,>f 

B(r, t) — e 2 B 0 e’’‘ r ~ io>t (M5) 

where z l and e 2 are constant unit vectors, and E 0 and B 0 are (possibly complex) 
amplitudes. The Maxwell equations div E = 0 and div B = 0 require that (Problem 

1 - 4 ) 


Ej k = 0 and e 2 • k = 0 (1-1 6) 

which mean that E and B are both perpendicular to the direction of propagation k. 
We call such a wave transverse. 

In addition, if we substitute Eqs. (1-15) into the Maxwell equation for curl E, 
we find that (Problem 1-5) 

(k x Ej)^ = e 2 B 0 (1—1 7) 

c 

We write this as 

c _ 0* x £ i) 

2 coB 0 (1-1 8) 

cE 0 

Since e t and e 2 are unit vectors and £j , e 2 , and k are mutually orthogonal (Problem 
1-6), we must have 

k = | k [ =— ^ d-19) 

CE 0 

and since k = co/v = (jis) ll2 a>lc, it follows that 
B 0 = (jie) l,2 E 0 
The Poynting vector is 


( 1 - 20 ) 
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and so for the case of a plane wave 

S = (Ei x e 2 ) c ^°^° cos 2 (k • r — cot) 

Now it is easy to show (Problem 1-7) that 

k 

£, X £ 9 =- 

1 2 k 

and so if we average S over one cycle, one finds that (Problem 1-8) 

= cE 0 B 0 k 
av Snfi k 

= ±(!\ m E 2 - 
8k W 0 k 


( 1 - 21 ) 


( 1 - 22 ) 


Since the index of refraction n is defined as the ratio of the speed of light in a 
vacuum to the speed of light in the material medium, then Eq. (1-14) shows that 


n = (jte) 1/2 

Thus Eq. (1-22) can be written as 


(1-23) 


cn _ „ veE 0 ‘ 


S ” 8 8 n 


(1-24) 


which equals Eq. (21-16) for the nonmagnetic medium (// = 1) discussed there. 
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PROBLEMS 

1-1. Derive the wave equation from Maxwell’s equations. 

1-2. Show that Eq. (1-13) is a solution to the wave equation. 

1-3. Show that the function 

£ _ e lU T-lat 


is a plane wave propagating in the direction of k. 

1-4. Derive Eqs. (1-16), showing that an electromagnetic wave is transverse. 
1-5. Derive Eq. (1-17). 

1-6. Prove that the vectors ci , c 2 , and k are mutually orthogonal. 

1-7. Prove that x e 2 = k /k. 

1-8. Derive Eq. (1-22). 



APPENDIX J 


THE RADIATION EMITTED BY AN 
OSCILLATING DIPOLE 


In this appendix we shall discuss the electromagnetic radiation emitted by an oscil- 
lating electric dipole, and, in particular, we shall derive Eqs. (21-46) and (21-47). 
We start with Maxwell’s equations 


div D — 4np 


. „ 4n 1 dD 

curl H = — J H — 

c c dt 


(J-1) 


div B = 0 


, „ 1 <3B 

curl Eh — = 0 

c dt 


with the constitutive equations D = eE, J = cE, and B = /xH. Since div B = 0 and 
div curl V = 0 for any vector V, we define a vector A, called the vector potential, by 

B = curl A (J— 2) 


With this form for B, the Maxwell equation for curl E reads 



(J-3) 


Since curl grad £ = 0 for any function £, the terms in parentheses here can be written 
as the gradient of some scalar function, e.g., 4>, to get 


„ 1 8 A 

Eh — = -grad <j> 

C Cl 

In terms of A and <f > , the two inhomogeneous Maxwell equations become 


(J-4) 


curl curl A + 


1 a 2 A 1 


d<l> 4n 


c 2 dt J 


+ - grad — = — J 


dt 


1 d 


div grad </> + - — div A — —4np 
c dt 


(J-5) 

(J-6) 
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or (Problem J-l) 


V 2 A — 


1 8 2 A 
c 2 dt 2 


1 3 

V 2 (f> + --(y-A)=-4np 
c dt 

— Wv • A + -~r-\ = — — J 
\ c dt) c 


(J — 7) 


(J-8) 


Since B = curl A, the vector potential is arbitrary to within an added gradient of 
some scalar function C* Thus B is unchanged by 


A-*A + Vf 

Similarly E is unchanged by (Problem J-2) 
v v cdt 


(J-9) 


(J-10) 


The two transformations (J-9) and (J-10) allow us to choose A and <f> such that 


a id 4> n 
A + --f=0 
c dt 


(J-11) 


Notice that this uncouples Eqs. (J— 7) and (J-8) to 


( J— 1 2) 

( J — 1 3) 


Equations (J-11) through (J-l 3) are equivalent to Maxwell’s equations. 

We remark in passing that transformations (J-9) and (J-10) are called gauge trans- 
formations, and the invariance of the fields under such a transformation is called 
gauge invariance. The particular relation given by Eq. (J-11) is called the Lorentz 
gauge. Although this gauge is not unique, it is the most commonly used since it leads 
to the wave equations, Eqs. (J-l 2) and (J-l 3). See the work by Jackson (“ References ”) 
for a detailed discussion of this problem. 

Both Eqs. (J-l 2) and (J-l 3) are inhomogeneous wave equations of the form 

1 3 2% ¥ 

V 2, P - s 'ei 2 = -4 ^ (r ’ (J_1 4) 


where /( r, t) is a known source distribution. Such an equation can be solved by the 
method of Green’s functions (cf. Jackson, Section 6.6) to give 


f ( r, I) - J — -/<r'. O dt' dt' (J -1 6) 

The quantity 

G(r, t ; r', t’) = — (J-1 6) 

is called a (retarded) Green’s function. 
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Of direct interest to this appendix is the special case of Eq. (J— 1 5) 

i r Jfr'.O J., . k-'l \ 




Assume now that J(r, t) = J(r)e i0> '. Equation (J-17) becomes then 
A(r, t) =A(r)e~ t ‘ 0, 

with 

1 r e lk \*-*'\ 


(J-17) 


(J-1 8) 


(J— 1 9) 


where as usual k = co/c. 

If the region in which J is nonzero is small and r is far from this small region, then 
|r — r'| a r, and Eq. (J— 19) becomes 


e , r c 

A(r) = — J J(r') dr' 


(J-20) 


More precisely, one expands the plane wave e , *l r-r 'l/| r — r'| in a series of Bessel 
functions and retains only the first term (cf. Jackson). Equation (J-20) can be inte- 
grated by parts to give 


Jj dr' = - Jr'[V • J(r')] dr' 


but from the equation for continuity of charge, namely, 

^ + div J = 0 
dt 

we see that if p(r, t) = p(j)e~ lC} \ then 
icop = div J 
and so 

e ikr 

A(r) = — iky.— 

where p is the electric dipole moment 


( J— 21 ) 


(J-22) 


H=JVp(r')rfr' 


(J— 23) 


(J-24) 


Equation (J— 23) is a central equation of this appendix, since once A(r, t) = A(r)e io>t 
is known, E and B are known through Eqs. (J-2) and (J-1), namely (Problem J-3), 


B = curl A 


(J-2 5) 


E = ^ curl B (J— 26) 

k 

Equation (J-26) is valid only away from the source. Straightforward application 
of these two equations to Eq. (J-23) yields (Problem J-4) 

e ikr 

B = k 2 (e. x p) — (J-27) 

r 

and 

E = B x z r 

where c r is a unit vector in the direction of r. 


( J — 28) 



APPENDIX J THE RADIATION EMITTED BY AN OSCILLATING DIPOLE 625 


In order to derive Eq. (21-47), the rate of radiation emitted per unit solid angle in 
the direction of X , we let c r = z x and p = (ZxVox + SyVoy + E zlk>z) cos Equa- 
tions (J-27) and (J-28) become (Problem J-5) 


B 


Akr 


— k 2 (z 2 Hoy — t*oz) c ^s cot — 


( J — 29) 


e ikr 

E = k 2 (e Y HoY + Cz/^oz) cos cot — 

r 


(J— 30) 


Note that E and B are perpendicular to each other as they should be (cf. Appendix I). 
The flux of energy is given by the Poynting vector S (cf. Appendix I) 


S=^(ExH) ( J — 31 ) 

4n 

or 

S=j-(Ex H*) (J— 32) 

if E and H are given in complex notation. 

Using Eqs. (J-29) and (J-30), we see that 

ck 4 

S = e x — (nor 2 + n oz 2 ) cos 2 cot (J— 33) 

471 

which is Eq. (21-47) if we average over one vibrational cycle. 
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PROBLEMS 

J-l. Show that Eqs. (J-5) and (J-6) are equivalent to Eqs. (J— 7) and (J-8). 

J-2. Show that E is unchanged by the transformation (1/c) (d£/d/) where f 

is a scalar function. 

J-3. Derive Eq. (J-26). 

J-4. Derive and discuss geometrically Eqs. (J-27) and (J-28). Remember that an electro- 
magnetic wave is transverse. 

J-5. Derive Eqs. (J-29) and (J-30). 



APPENDIX K 


DIELECTRIC CONSTANT AND 
ABSORPTION 


In this appendix, we introduce the idea of representing the frequency-dependent 
dielectric constant as a complex number, and then interpret the real and imaginary 
parts. First recall the static equations 

D = Ana D = E + 4nP 


D = e s E P=xE (K-1 ) 

D = (1 + 4t tx)E (K— 2) 

where e s is the static (zero-frequency) dielectric constant and x is the susceptibility. 
These equations become inapplicable for high-frequency fields, or if we wish to keep 
these equations, we must generalize and reinterpret e and y. This is what we shall do. 

We could look at how P(t) behaves when E(t) varies, but it is more convenient to 
look at EXj) instead of P{t). Since D = (1 + 4nx)E, this amounts to essentially the 
same thing. Consider the case in which E(t ) = E 0 cos cot. If this field has persisted for 
a time, D(t) must also vary with the same frequency, but must not necessarily be in 
phase with E(t) since, in general, there is a time lag between P(t) and E(t). This can 
be expressed by writing 

D(t) = D 0 cos (cot — <j>) = D x cos cot + D 2 sin cot (K-3) 

with 


Z)| — Z)q cos (j) jDj — Dq sin (j) (K — 4) 

The phase angle is certainly a function of co and should go to zero as w goes to zero, 
which gives D{t) = D x cos cot = eE 0 cos cot as co 0. In general, however, we can 
define two “dielectric constants” b\<o) and e"(co) by 

D x = e'(o)E 0 and D 2 = e{(o)E 0 (K-5) 

The dielectric constant we studied before then was £'(0)- Again we say that £"(co) 0 
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as co-+0. We shall show shortly that e"(o) is directly related to the absorption 
coefficient of the system. Since this represents energy lost from the incident electric 
field, this is often called dielectric loss. 

It is convenient to define a complex dielectric constant by 

i(co) = e(o) - ie"(o)) (K— 6) 

since if we write E = E 0 e lo> \ we then have 


D = tE 


(K— 7) 


even for time-dependent fields. We, of course, take only the real part of this equation. 
Realize that writing I is a convenient formal device that allows us to write D = eE. 
It does not mean that e becomes complex at high frequencies. We shall see how Max- 
well’s equations can similarly be reinterpreted to accommodate a complex formalism 
for e, but before doing this let us look at the relation between E{t) and D{t) once again. 

We can represent the time lag between D(t ) and E(t) 9 or P{t) and E(t)> by means 
of an after-effect or response function : 

D(t ) = E(t) + f E(t')a(t - O df (K— 8) 

J o 

where the part of D that can follow E instantaneously is written as a separate term. 
Introducing E{t) — E 0 cos ot, we get 

D(i) — E 0 cos ot = Eq f a{t — t') cos cot ' dt' 

J o 

= E 0 f a(x) cos co{t — x ) dx (K-9) 

J o 

We assume now that E has persisted long enough to make D(t) a periodic function 
with frequency co. This means that t is larger than the time t 0 it takes for a (t) to vanish. 
In view of this then, we can set the upper limit of the above integral equal to oo, and so 
Eq. (K-9) becomes 

f 00 

D(t) — E 0 cos cot = E 0 a(x) cos co(t — x) dx 

J o 

or using Eqs. (K-3) and (K-5), 


(e'(co) — 1) cos ot + e"(o) sin ot 

r°° r°° 

= cos ot a(x) cos ox dx + sin ot a(x ) sin ox dx 

J ri 


from which we see 


r 00 

e'(o) — 1 = a(/) cos ot dt (K— 1 0) 

J o 

e"(o) = f a(/) sin ot dt (K-l 1 ) 

J o 

These equations are similar to the equations that are derived in Section 21-5. They 
are derived here without giving a(t) any molecular significance, whereas in Section 
21-5, a(t) is a dipole moment correlation function. It is from a pair of equations just 
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like this that the Kramers-Kronig relations between e'(co) and e"(o) are derived [cf. 
Eqs. (21-182 and 21-183)]. 

Now we wish to show how the introduction of i affects some other electromagnetic 
results, particularly Maxwell’s equations. One result we wish to preserve is the low- 
frequency relation between the dielectric constant e and the index of refraction n. 
This relation is e = n 2 . We shall accordingly define a complex index of refraction by 


i=h 2 

(K-1 2) 

As in all the generalizations we shall make, we assume that the real part of our com- 
plex quantities is just the ordinary property and that the pure imaginary part vanishes 
as co -> 0. This is simply to assure consistency with all our ordinary low-frequency 

equations. 


Consider now Maxwell’s equations 


f „ 1 dB 4nJ 

curl H — = 

c ot c 

(K-1 3) 

i ^ ldB * 

curl EH — =0 

c ot 

(K-1 4) 

div D = 4np 

(K-1 5) 

div B = 0 

(K-1 6) 


along with the material equations J = <rE, D = eE, B = jiH. We shall confine our- 
selves to nonmagnetic materials (fi — 1) and no currents or free charges (J = 0; p = 0). 
Now eliminate B = /zH from the first two Maxwell equations by taking djdt of the 
first and taking curl of the second (curl curl =grad div —V 2 ). One eventually gets 


V 2 E 


e/i 3 2 E 


(K— 1 7) 


i.e., a wave equation for E. It was from this equation incidentally that Maxwell pre- 
dicted that electromagnetic radiation was wavelike and propagated with a velocity 
c/(/xe) 1/2 (s and /i = 1 in a vacuum). 

Now substitute E x = E 0 e i{c}t ~ kx) into the wave equation to get 


V 2 E X + 


EfiCO' 


E x = 0 


(K— 1 8) 


This tells us that the wave vector k is 

2 Epco 2 eco 2 . 

k = — 2 — = — j- (nonmagnetic) 

c c 

Now in order to preserve the form of this equation, we define 



c 


(K-1 9) 


(K— 20) 


(K— 21 ) 
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Set 


a =/i-/K=(i) I/2 

(K — 22) 

and so 


e’(<d) =rt 2 — k 2 

( K— 23) 

e"(<u) = 2 Kit 

(K— 24) 

Our equation for E x is 


E x = E 0 e^e-** = E 0 

(K — 25) 

or the intensity is \E X \ 2 = E 0 2 e~ 2mKI,c . Comparing this to Lambert’s law, I 
we finally see that 

=v~ ax . 

, x 2 (ok c"(co)co coe'Yco) 

a(co) = = « (gases) 

c nc c 

(K— 26) 


The quantities without the ~ denote the real part, and we have used the fact that 
n ^ 1 for gases. Thus we see that the absorption coefficient a(co) and the imaginary 
part of the complex dielectric constant are directly related. 
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Absolute activity, 55 
Absorbing barrier, 399 
Absorption coefficient, 471, 496, 629 
Absorption lineshape, 472, 501 
Accessible quantum states, 36, 37, 45, 46 
Acoustic attenuation coefficient, 571 
Acoustical branch, 210 
Activity, 297 
Activity coefficient, 338 
Adiabatic compressibility, 29, 390 
Adjoint operator, 616 
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After-effect function, 490, 539, 627 
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Angular momentum 
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Antisymmetric wave function, 13 
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Asymmetric top, 134 
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Autocorrelation function, 469 
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Born approximation, 544 
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Boyle temperature, 247 
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Canonical ensemble, 37 
of thermodynamic functions, 44, 45, 58 
Canonical ensemble average, 39, 40, 182 
Canonical ensemble partition function, 44 
Capillary flow, 394 
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Center of mass, 26, 133, 140 
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Chapman-Enskog method, 419, 422, 426, 
446 

and time correlation function, 525-527 
Chapman-Kolmogorov equation, 457 
Characteristic function, 31, 55 
Characteristic rotational temperature, 99, 132 
Characteristic thermodynamic potential, 66 
Characteristic vibrational temperature, 97, 
132 


Charging parameter, 330, 336 
Chemical equilibria, 18, 142 
Chemical kinetics from light scattering, 568 
Chemical potential, 17, 143, 151 
Classical configuration integral, 116 
Classical limit, 72, 75, 76, 113, 185-189 
equilibrium, 72, 75, 76, 113, 185-189 
nonequilibrium, 494, 512, 535 
Classical mechanics, 3 
Classical partition function, 113 
Classical statistical mechanics, 113 
Classical statistics, 72 
Clausius-Mossotti equation, 63 
Closest-packed hard-sphere volume, 279 
Closure relation, 473, 614 
Cluster diagrams, 230, 231, 232, 246 
connected, 231 
doubly connected, 231, 232 
integrals of, 230 
stars in, 231, 232 

Coefficient of bulk viscosity, 385, 386, 521, 
522, 523, 539 
reduced, 531, 539 

Coefficient of self-diffusion, 431, 448 
reduced, 435, 448, 531, 532, 539 
and time-correlation function, 515 
Coefficient of shear viscosity, 361, 385, 386, 
431, 447 

frequency-dependent, 583 
reduced, 434, 448, 531, 532, 539 
and time-correlation function, 519 
Coherent neutron scattering, 547 
Collisional invariants. See Summational 
invariants 

Collision cross section, 372 
Collision cylinder, 407 
Collision frequency, 370, 373, 375, 524, 525 
Collision-induced absorption, 375, 503, 505, 
506, 541, 542 

Collision-induced dipole moment, 542 
Collision-induced light scattering, 482, 484 
Collisional integrals, 428, 430 
for hard spheres, 430 
and recursion formula, 448 
reduced, 431, 447 
table of, 434 

and thermal transport coefficients, 431 
Collision operator, 409 
Collision term, 410, 419, 422 
Collision time, 524, 525 
Collision trajectory, 376 
Color center, 216, 220 
Combinatorial formulas, 22, 23 
Commutator, 494, 615 
repeated, 502 

Complementary solution, 394 
Complete set, 613, 597, 598 
Completeness relation, 536, 537, 614 
Compressibility factor, 246, 248, 283 
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Configuration integral, 116, 124, 222, 227 
Conjugate momentum, 7 
Conservation equations of hydrodynamics, 
380 

Conservation of angular momentum, 27 
Conservation of density in phase, 120 
Conservation of energy, 8, 27 
Conservation of extension in phase space, 
120, 422 

Conservation of momentum, 26 
Constitutive relations, 618, 622, 628 
Continuity equation, 380 
Convolution approximation, 586 
Convolution theorem, 269, 270, 573, 586 
Correlation function, 453, 468 
and classical limit, 512 
and dipole, 473 
in canonical ensemble, 268 
in grand canonical ensemble, 268 
one-sided, 501 
and polarizability, 486 
statistical, 555, 556 
Correlation time, 453, 468-469 
Corresponding states, law of, 200, 204, 205, 
243-245, 248, 434, 435, 448, 531, 532, 
539 

Couette flow, 393, 400 
Couette-Poiseuille flow, 400, 401 
Coupled harmonic oscillators, 196, 207 
Coupling parameter, 263, 264, 290 
Critical constants, 244, 257, 288 
Critical parameters, reduced, 288 
Barker-Henderson theory, 309, 315 
square-well fluid, 309 
from various theories, 288 
Cross section, 372 
Crystals 

Debye model of, 200-206 
defects of, 214-217, 219, 220 
Einstein model of, 197-200 
vapor pressure and, 220 
Cumulants, 548 
Cyclic graphs, 343, 353 
Cyclic permutation, 498, 511, 535 


de Broglie wavelength, 83, 90, 163, 207 
Debye 

and dielectric relaxation, 497 
and frequency, 202 

and heat capacity function, 203, 219, 599 
and rotational relaxation, 398 
and temperature, 32, 202, 203, 206, 221 
and theory of heat capacity of ciystals, 

32, 200-206, 218, 219 
Debye-Hiickel kappa, 332, 352 
Debye-Hiickel radial distribution function, 
353 

Debye-Hiickel screening parameter, 332, 352 


Debye-Hiickel theory, 328 
and ionic atmosphere, 335, 352 
and limiting law, 336 
from Mayer theory, 343 
from Omstein-Zemicke equation, 344-345 
and potential of mean force, 331, 334 
and screening parameter, 332 
and thermodynamics, 336-339 
Deflection angle, 368, 375, 433 
for hard spheres, 370, 375, 446 
Degeneracy 

harmonic oscillator, 10, 94 
W-particle system, 11, 12, 28, 36 
particle-in-a-box, 10, 11, 28 
rigid-rotor (three-dimensional), 10, 94 
rigid-rotor (two-dimensional), 112 
spherical top, 134 
symmetric top, 135 
Degrees of freedom, 114 
rotational, 130 
separation of, 116, 130 
translational, 130 
vibrational, 130 

Delta function, 20, 30, 187, 472, 504, 535, 
595-598 

integral representation of, 473 
Density matrix, 185, 537 
Density of states 

in phase space, 118, 403 
photon gas, 179 
three-dimensional ideal gas, 82 
two-dimensional ideal gas, 28, 89 
Depolarization ratio, 482, 484 
Deviation plots, 438, 439 
DHLL + B 2 , 347 
Dielectric constant, 50 
and absorption, Appendix K, 629 
complex, Appendix K 
frequency-dependent. Appendix K 
Dielectric loss, 496, Appendix K 
Dielectric relaxation, 495 
Dieterici equation, 246 
Difference equation, 207 
Differential cross section, 422 
neutron scattering, 544 
Raman scattering, 484 
Diffuse double layer, 354 
Diffusion coefficient, 363, 364 
frequency-dependent, 582, 591 
generalized, 591 
and mixtures, 449 
Diffusion constant, 216, 220 
Diffusion equation, 386, 397, 513, 564 
with anisotropic, 398 
with chemical reaction, 568 
with drift, 398, 564 
Fourier transform and, 386, 388, 513 
fundamental solution for, 387, 388, 

397 
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Diffusion equation ( Continued ) 
method of images in, 399 
in phase space, 459 
rotational, 398 
in velocity space, 458 
Diffusion in solids, 215, 216 
Diffusion tensor, 580 
Diffusion velocity, 407 
Dipole moment, 49 
Dipole moment operator, 471 
Dirac 8-function, 20, 30, 187, 472, 504, 535, 
595-598 

Dirac notation, Appendix G 
Direct correlation function, 268, 297 
density expansion, 274 
hypernetted chain equation, 276 
mean spherical model, 355 
Percus-Yevick equation, 276, 297 
Direction cosines, 479, 480, 533 
Dispersion curve, 208, 209, 219 
Displacement vector, 49 
Dissociation energy, 96, 110, 132 
Distinguishable particles, 69, 70, 71 
Distribution function 
in canonical ensemble, 257 
in grand canonical ensemble, 268 
Doob’s theorem, 560, 561 
Doppler broadening, 127 
Doppler effect, 127 
Double dot notation, 384 
Dual space, 613 

Dulong and Petit, law of, 49, 199, 204, 218 
Dyadic, 383 


Ehrenfest dog-flea experiment, 423 
Eigenvalues 

harmonic oscillator, 10, 94 
particle-in-a-box, 10, 27, 28 
rigid-rotor, 10, 94 
Einstein 

characteristic temperature of, 32, 49, 199 
heat capacity theory of, 3 1 , 49, 1 97-200, 218 
Einstein relation, 577 
Elastic collision, 412 
Elastic constants, 206 
Elastic moduli, 584 
Electrode potential, 354 
Electron gas in metals, 164-168 
Electronic heat capacity of metals, 168, 221 
Electronic partition function, 83 
Electrophoretic mobility, 564 
Energy balance equation, 384 
Energy conversion factors, 594 
Energy density (electromagnetic radiation), 
679 

Energy equation, 261, 419 
mixtures, 293 


Ensemble 
canonical, 37, 403 
grand canonical, 51 
isothermal-isobaric, 55 
microcanonical, 37, 56 
Ensemble average 
classical, 118 
quantum, 35, 182 

Ensemble average reversible work, 43, 54 
Ensemble postulate of Gibbs, 36, 41, 118 
Enskog equation of change, 412 
Enskog theory, 440 
modified, 443 

and time-correlation functions, 530 
Entropy, 13, 78 

canonical ensemble and, 43, 48, 58 
grand canonical ensemble and, 54, 58 
of ideal diatomic gas, 109 
of ideal monatomic gas, 86 
of ideal polyatomic gas, 138 
isothermal-isobaric ensemble and, 58 
microcanonical ensemble and, 55, 58 
Entropy and disorder, 56, 138 
Equation of state, 246, 247 
Barker-Henderson, 310, 321 
Chandler-Weeks- Andersen, 316 
ideal gas and, 49 

ideal quantum gas and, 163, 170, 174, 175, 
176 

van der Waals, 304, 321 
van der Waals, generalized, 321 
Equation of state, hard spheres, 257, 292 
Pad6 approximant, 255-257 
Ree-Hoover, 255-257 

Equilibrium constant, 19, 29, 143, 144, 151, 
155 

Equipartition of energy, 121, 127, 128, 454 

Equivalence of ensembles, 63 

Ergodic hypothesis, 554, 555 

Error function, 127 

Escape velocity, 373 

Eucken correction, 374, 449 

Euler angles, 535 

Euler equation, 389 

Euler-MacLaurin summation formula, 99, 

110 , 111 

Euler’s theorem, 17, 18, 65 
Exponential approximation, 351 
Exponential-6 potential, 436 
Extensive property, 17, 243 
Extent of reaction, 18 

Faraday’s law, 618 
F-center, 216, 220 

Fermi-Dirac distribution function, 164 
Fermi-Dirac statistics, 73, 74, 160 
Fermi energy of a metal, 165, 190 
Fermi golden rule. Appendix F 
Fermi pseudopotential, 546 
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Fermi temperature, 166, 190, 221 
Fermions, 13, 69, 101, 160 
/■function, 228, 342 
Fick’s Law, 216, 386 
generalized, 582, 591 
Field point, 272 
Flow velocity, 407 
Fluctuating force, 452 
Fluctuation-dissipation theorem, 463, 540, 
541 

Fluctuations, 57, 79 
of canonical ensemble, 59, 60 
of grand canonical ensemble, 61, 62 
of isothermal-isobaric ensemble, 62, 67 
Flux vector, 408 
Fokker-Planck equation, 456 
multidimensional, 464 
solution for, 464 
Force constant, 27, 93, lio, 196 
Fourier series, 557, 587 
Fourier’s law, 385 

Fourier transform, 186, 312, 313, 397, 
595-598 

and convolution theorem, 269, 270, 586 
and diffusion equation, 386, 388, 397, 

513, 514 

and Navier-Stokes, equation, 517 
Free electron model of metals, 164-168 
Frenkel defect, 214, 220 
Frequency-dependent coefficient of shear 
viscosity, 583 

Frequency-dependent conductivity, 490, 539 
Frequency-dependent dielectric constant, 
496, Appendix K 

Frequency-dependent diffusion coefficient, 
582, 591 

Frequency-dependent susceptibility, 490, 495 
Frequency spectrum of crystals, 194-197, 
212 

Friction constant, 452, 463 
Fugacity, 248, 290 
Functional, 275 
Functional differentiation, 275 
Fundamental solution, 387 
Fused salts, 324 


Gamma function, 12, 32, 247 

#-bond, 342 

Gauge invariance, 623 

Gauge transformation, 623 

Gauss’ divergence theorem, 380, 403 

Gauss’ law, 329, 351 

Gaussian distribution, 20, 24, 30, 454, 460 
Gaussian function, 20, 24 
Gaussian gas, 252 
Gaussian integrals, 252 
Gaussian-Markov process, 559, 589, 590 


Gaussian random process, 559, 587, 589, 
590 

Generalized coordinates, 114 
Generalized hydrodynamics, 584 
Generalized momentum, 7 
Generalized velocities, 8 
Geometric series, 32, 97 
Gibbs-Duhem equation, 18 
Gibbs-Helmholtz equation, 263, 290, 353, 
356 

Gibbs postulate, 36, 41, 118 
Golden Rule of perturbation theory, 471, 
Appendix F 

Good quantum number, 87 
Gouy-Chapman theory, 354 
Grad’s solution of the Boltzmann equation, 
429, 430 

Gram-Charlier series, 542, 585, 586 
Grand canonical ensemble, 51 
Grand partition function, 54, 58, 65 
Graphs, 230-232 
connected, 231 
doubly connected, 231, 232 
Green’s function, 386, 397 
retarded, 623 
Group velocity, 210, 219 
Grundke-Henderson program, 289, 322, 
354, Appendix D 
Gruneisen constant, 218 


Hamiltonian function, 7 
Hamiltonian mechanics, 7, 404 
Hamiltonian operator, 9 
additive, 12, 69, 131 
harmonic-oscillator, 10 
rigid-rotor, 10, 27, 28 
Hard disks, 443 

Hard-sphere potential, 307, 312, 313, 314 
Hard spheres, fluid of 
and equation of state, 279, 280, 256-275 
and mixtures, 297 

and molecular dynamics calculations, 
256, 280 

and radial distribution function, 282, 
Appendix D 

and thermodynamic properties, 283, 292, 
293 

and virial coefficients, 234, 239, 241, 279 
Harmonic oscillator 
classical, 4, 26, 27 
quantum, 10 
Heat capacity 

Debye theory and 200-206, 218 
Einstein theory and, 197-200, 218 
low temperature limit and, 204, 205, 218 
of ortho-para hydrogen, 106, 107 
Heat of reaction, 149, 158 
Heat of transition, 176 
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Heisenberg equation of motion, 492, 502, 
617 

Heisenberg representation, 472, Appendix H 
Heisenberg uncertainty principle, 9 
Henry’s law constant, 324 
Hermite polynomial, 542, 585 
Hermitian operator, 508, 537, 575, 588, 615, 
616, 617 

Heterodyne method, 562, 567 
Hierarchy of coupled equations, 266, 290, 
406, 418, 419 

High temperature limit, 75, 118, 161 
Hindered rotation, 138, 139, 140, 150 
Homodyne, 562, 566, 567 
Homogeneous function, 18 
Hooke’s law, 4 
//-theorem, 413 
Hydrodynamic derivative, 381 
Hydrodynamic stage, 426 
Hydrogen atom catastrophe, 90 
Hypemetted-chain equation, 276 
and density expansion, 291 
and ionic system, 346 
Hypershell, 125 
Hypersphere, 12, 28, 124 

Ideal gas 
diatomic, 91-112 
in an electric field, 49 
monatomic, 80-90 
and photons, 177-182, 193, 212 
polyatomic, 129-141 
Ideal gas of photons, 177-182, 193, 212 
Ideal hydrodynamic equations, 389, 419, 

427 

Image source, 399 
Impact parameter, 367 
Imperfect gas, 222-253 
polyatomic, 249, 250 
Incoherent neutron scattering, 547 
Incompressible fluid, 396, 397 
Index of refraction, 628, 629 
complex, 628, 629 
frequency-dependent, 628, 629 
Information theory, 48, 589 
Inhomogeneous integral equation, 427 
Inhomogeneous partial differential equation, 
394 

Inhomogeneous wave equation, 623 
Inner product, 612 
Integra] equation, linear 
solution of, 450, 451 
Integrating factor, 14, 44, 54 
Integrating factor, PDE, 464 
Intensive property, 17, 243 
Interionic radial distribution function, 353 
moment condition, 355 
Intermediate statistics, 78 
Intermolecular potential, 227, 228 


Internal pressure, 443 
Internal rotation, 138, 139, 140, 150 
Intemuclear potential, 91, 92 
Inversion temperature, 249 
Ionic activity coefficient, 338 
Ionic atmosphere, 335, 336 
Ionization, 158, 159 
Irrotational flow, 391 
Isothermal compressibility, 62, 67 
Isothermal-isobaric ensemble, 55 
Isothermal-isobaric partition function, 55, 
65, 90 

Isotope effects on equilibrium constants, 
145-148, 158 

Isotope exchange reaction, 145, 147, 158 

Jacobian, 371, 422 
j-j coupling, 88 

Joule-Thomson coefficient, 249 

Kac ring model, 424, 425 
&-bond, 342 
Kernel, 574 
Ket vector, 613 
Kihara correction, 449, 450 
Kihara potential, 436 
Kinetic theory of gases, 125 
and collision probability, 359 
and diffusion coefficient, 363, 364 
and effective hard-sphere diameter, 365 
and mean free path, 358, 359 
and thermal conductivity, 362 
and viscosity, 360, 361 
Kirchoff ’s law, 376 
Kirkwood equation, 253, 264 
Kirkwood hierarchy, 290 
Kramers-Kronig relations, 498, 499 
Kroenecker delta, 184, 382 
Kubo transform, 493, 494, 495, 511, 535, 536 
and classical limit, 535 

Lagrange undetermined multipliers, 23, 40, 
48, 53 

Lagrange’s equations, 6, 27 
Lagrangian function, 6 
Lagrangian mechanics, 6 
Lambda transition, 177 
Lambert’s Law, 496, 629 
Landau-Placzek ratio, 571, 588 
Langevin equation, 452 
Langmuir adsorption isotherm, 79, 221 
Laplace’s equation 
Larmor power formula, 376 
Laser light scattering, 561 
chemical kinetics, 568 
linear hydrodynamics, 568 
Lattice dynamics, 206-212 
Lattice vacancy, 214-217 
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Law of corresponding states 
equilibrium, 200, 204, 205, 243-245, 248 
thermal transport coefficients, 531, 532, 
539 

Laws of thermodynamics, 13 
Legendre polynomial, 398, 487, 488 
Legendre transformation, 15, 30 
Lennard- Jones 6-12 parameters 
from second virial coefficient data, 434 
from viscosity data, 434 
Lennard-Jones, potential, 234 
Light scattering, 62, 298, 299, 476 
collision-induced, 482-484 
laser, 561 

Line shape, 375, 472, 501 
Linearized Navier-Stokes equation, 570 
and Fourier transforms, 517 
Linear velocity, 406, 407 
Liouville equation, 120, 125, 404, 507 
quantum mechanical, 510, 537 
Liouville operator, 404, 508, 537 
and Poisson brackets, 508 
Liouville’s theorem, 412 
Local compressibility approximation, 308 
Local equilibrium, 361, 413 
Local Maxwell distribution, 419, 427 
Longitudinal waves, 201 
Lorentz gauge, 623 
Lorentzian line shape, 504, 564 
Loschmidt objection, 415 
L-S coupling, 88 


Macroscopic compressibility approximation, 
308 

Magnetic dipoles, 79 
Markov process, 457, 555 
Markovian spectrum, 560 
Mass average velocity, 407 
Mass balance equation, 380, 381 
Master equation, 423 
Matrix elements, 183 
Maximum term method, 25, 31 
Maxwell-Boltzmann distribution, 125, 358, 
415 

Maxwellian molecules, 370 
Maxwell potential, 369 
Maxwell’s equation, 618, 622, 628 
Mayer convergence factor, 342 
Mayer /-function, 228 
Mayer summation function, 342, 353 
Mayer theory, 340 
McMillan- Mayer theory, 327 
Mean free path, 358, 359, 373 
binary mixture, 373 
distribution, 374 

Mean free path kinetic theory, 358 
and time correlation function, 523 


Mean ionic activity coefficient, 338, 339, 
352 

concentrated solutions, 340 
hypemetted-chain equation, 347 
limiting law, 339 
mean spherical model, 356 
mode expansion, 349 
optimized random phase approximation, 
347, 348 

Mean spherical model, 348, 355 
for dipoles, 356 

Mean-square displacement, 397, 455, 462, 
514 

Mechanical property, 35 
Memory function, 572 
Metals, 164-168 
Method of images, 399, 400 
MEX, 348 

Microcanonical ensemble, 37, 56 
Microscopic reversibility, 421 
Mie-Gruneisen equation, 218 
Mixtures, fluid, 293, 294, 295, 296, 297 
effective Lennard-Jones parameters, 294 
energy equation, 293 
Omstein-Zemicke equation, 293 
Percus-Yevick equation, 293 
pressure equation 
Mixtures, gases, 248 
virial expansion of, 248 
Mode expansion, 348 
Modified cluster integrals, 271 
Molecular chaos assumption, 410, 422 
Molecular collisions, 365-370 
cross section of, 372 
frequency of, 370, 373 
Molecular constants 
for diatomic molecules, 95 
for polyatomic molecules, 132 
Molecular dynamics, 255, 256, 281, 297, 
441, 442, 456 

and Boltzmann //-theorem, 416 
and diffusion constant, 578 
and Enskog theory, 442 
and Gaussian approximation, 590, 591 
and hard-sphere radial distribution 
function, 288, 289, Appendix D 
and hard-sphere structure factor, 301 
and mean-square displacement, 456 
and power spectrum, 552, 579 
and radial distribution function for hard 
spheres, 288, 289, Appendix D 
and radial distribution function for 
Lennard-Jones fluid, 259 
and radial distribution function 
for square well fluids, 308 
and superposition approximation, 280, 
281 

and velocity autocorrelation function, 
516, 578 
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Molecular partition function, 70, 75, 113 
Molecular velocities, 358 
relative, 371 

Moment of a distribution, 20 
Moment of inertia, 94, 110, 133, 140 
Momentum balance equation, 381, 419 
Momentum, conjugate, 7 
Momentum eigenfunction, 186 
Monte Carlo calculations, 288, 289 
ionic systems, 346 
mixtures, 296, 297 
Monte Carlo method, 240 
Morse potential, 92, 1 10 
/n-6-8 potential, 437, 604 
and parameters, 439 
tables for, Appendix E 
Multicomponent mixture, gases as 
virial expansion of, 248 
Multinominal coefficient, 22, 25 
distribution of, 22, 31 
theorem on, 22 

Natural variables, 15, 45, 55 
Navier-Stokes equation, 391, 515 
linearized, 396, 516, 538, 570 
TV-dimensional sphere, 12, 28, 124 
Neutron scattering, 544 
coherent, 547 
incoherent, 547 

Newton’s law of viscous flow, 361 
Newtonian mechanics, 3 
Newtonian pressure tensor, 396, 446 
Node, 274 

Nonmechanical thermodynamic variable, 35 
Nonviscous flow, 389, 390, 391 
Normal coordinate analysis, 131 
Normal coordinates, 131, 132, 194, 195, 196 
Normal random process, 559, 560 
Normal solution, 426, 428 
No-slip boundary condition, 393, 394 
Nuclear partition function, 83, 85 
Nuclear spin functions, 102 

Occupation number, 38, 212 
One-fluid theory, 294 
One-sided correlation function, 501 
Opalescence, critical, 62 
Optical branch, 210 

Optimized random phase approximation, 348 
Ornstein-Zemike Equation, 269, 270, 275, 
290, 344 

for mixtures, 293 
ORPA -h B 2 approximation, 350 
Ortho- and para-molecules, 105, 111 
Ortho-para deuterium, 111 
Ortho-para hydrogen, 105, 106, 107 
heat capacity of, 106, 107 
Oscillating dipole, 478, Appendix J 


Osmotic coefficient, 338 
DHLL + .B 2 , 347 
hypernetted chain, 346 
mean spherical model, 350 
MEX, 349 

Monte Carlo calculations, 346, 356 
Percus-Yevick approximation, 346 
ORPA -f- B 2 , 350 
various theories, 356 

Pad6 approximant, 256, 257, 283, 289 
Pairwise additivity, 124, 228 
Parallel band 

Parseval’s theorem, 556, 587 
Partition function 
classical, 113-117 
electronic, 83, 84, 109, 130 
independent, distinguishable particles and, 
70 

independent, indistinguishable particles 
and, 71 

nuclear, 83, 85, 130 
rotational, 98, 104 
translational, 81, 82, 93 
vibrational, 96, 197 

Partition function and thermodynamic 
functions, summary of, 58 
Peculiar velocity, 407 
Percus-Yevick-Allnatt equation, 346 
comparison to Monte Carlo data, 346 
Percus-Yevick equation, 276 
comparison to experiment, 284, 288 
density expansion, 277, 278 
hard-sphere equation of state, 279, 280 
mixtures, 293 

Periodic boundary conditions, 209 
Perpendicular band, 474 
Phase point, 117, 402 
Phase space, 117, 402, 403, 417 
Phase velocity, 208 
Phonon, 207, 212, 213 
Phonon gas, 212, 213 
Photon, 177 

Physical constants, table of, 593 
Planck radiation law, 31, 182, 192, 193, 

376 

Planck’s constant, 593 
Plane wave, 620, 621 
Plane wave expansion, 624 
Poincare cycle, 416, 424 
Poincare recurrence theorem, 416, 424 
and irreversibility, 424 
Point defects in solids, 214-217 
Poise, 365 

Poiseuille flow, 392, 393, 394 
Poisson-Boltzmann equation, 
linear, 332 
non-linear, 331, 354 
Poisson bracket, 404, 494 
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Poisson distribution, 20, 30, 90 
Poisson equation, 328, 330, 351 
Polarizability, 476, 477, 484, 533 
anisotropic part, 480, 533 
longitudinal, 484 
A-body expansion, 251 
spherical part, 480 
table of, 481 
tensor, 477 
transverse, 484 
Polarization vector, 49, 179 
Postulate of Gibbs, 36, 41, 118 
Potential barriers of rotation, 138, 139, 140 
table of, 140 

Potential of mean force, 266, 267, 290 
Debye-Huckel, 331 
Power spectrum, 557, 587 
Poynting vector, 471, Appendix I 
Prefactor, 613 

Pressure broadening, 503, 504 
Pressure due to radiation, 181 
Pressure equation, 261-262, 289 
for hard spheres, 280 
for mixtures, 293 
Percus-Yevick equation as, 279 
Pressure tensor, 381, 382, 408 
Primitive model, 326 
restricted, 328 
Principal axes, 477, 533 
Principal moments of inertia, 133 
Principle of equal a priori probabilities, 36, 
118 

Probability distributions, 20 
Products of inertia, 134 
Projection operator, 574, 576, 588 
Pure quantum state, 37 

Quantum corrections to B 2 {T\ 250 
Quantum mechanics, 9 
Quantum statistics, 73 
Quasi-particles, 69 

Radial distribution function, 259, 290 
and chemical potential, 264 
computer program for. Appendix D 
density derivative, 290 
density expansion, 262, 263, 272, 273 
first density correction, 273, 290 
hard-sphere, 282 
interionic, 353 
temperature derivative, 291 
and virial coefficients, 262, 263 
Raman scattering, 476, 486-488 
Random mixture theory, 293, 294 
Random phase approximation, 348 
Random variable, 20 
continuous, 20 
discrete, 20 
independent, 30 


Rate of shear tensor, 392 
Rayleigh- Jeans law, 192 
Rayleigh scattering, 63, 476, 564, 571 
Recurrence time, 416 
and irreversibility, 417 
Reduced distribution functions, 405 
Reduced mass, 27, 93, 1 10 
Ree-Hoover Pade approximant, 257, 280 
Reference system, 302 
Relative coordinates, 366 
Relativistic kinetic energy, 127 
Relaxation time, 419 
Renormalized potential, 350 
Resolvent operator, 574 
Restricted primitive model, 328 
numerical results, 346, 347, 349, 350 
Resummation of virial series, 342 
Retarded Green’s function, 623 
Reversal paradox, 415 
Reversion of a series, 163, 170, 190, 225 
Riemann zeta function, 33, 34, 171, 180, 
191 

Rigid rotor 
classical, 27, 128, 141 
quantum, 10, 94 
Rigid rotor-harmonic oscillator 
approximation, 91, 93, 111 
Root-mean-square speed, 126 
Root-mean-square velocity, 358 
Root point, 272 
Rotation matrices, 534, 535 
Rotation-vibration coupling constant, 112 
Rotational constant, 94, 110, 134 
for diatomic molecules, 95 
for polyatomic molecules, 1 32 
Rotational diffusion equation, 398 
Rotational Hamiltonian, 27, 128, 141 
Rotational kinetic energy 
asymmetric top and, 141 
of diatomic molecule, 27 
of linear molecule, 27 
symmetric top and, 141 
Rotational Laplacian, 398 
Rotational partition function 
asymmetric top and, 136 
of heteronuclear diatomic molecule, 98 
of homonuclear diatomic molecule, 104 
of linear molecule, 133 
spherical top and, 135 
symmetric top and, 135 
Rotational relaxation, 398 
Rotational spectrum, 94, 110, 111 
Rotational temperature, 99, 132 
Rotational wave functions, 102, 103 
Russell-Saunders coupling, 88 


Sackur-Tetrode equation, 86 
Scalar product, 612 



640 


INDEX 


Scattering function, 298, 299 
neutron, 545 

Schottky defect, 214, 219 
Schrodinger equation, 9 
Schrodinger representation, 472, 616, 617 
Schwartz inequality, 539, 576 
Screened coulombic potential, 334 
Second law of thermodynamics, 13, 44, 46 
Second moment condition, 355 
Second virial coefficient, 227, 228, 233-237 
angle-dependent potentials and, 249, 250 
determination of potential parameters 
from, 236, 237, 248 
experimental, 233, 235 
hard-sphere potential and, 234 
Lennard-Jones potential and, 235, 247, 248 
quantum corrections and, 241 , 242, 243, 
250, 315, 316 
reduced form of, 233, 236 
square well potential for, 235 
Sutherland potential for, 247 
triangle potential for, 247 
Selection rules 
for harmonic oscillator, 94 
for rigid rotor, 94 

Shear modulus (modulus of rigidity), 583 
Shear viscosity, 361, 385, 386, 431 
reduced, 531, 532, 539, 434, 448 
Sonine polynomials 

Sound, speed of,29, 127, 208,249, 390, 391, 572 
Space-time correlation function, 545 
distinct, 545 
self, 545, 564 
Spectral density, 556, 557 
Speed of light, 593 
Speed of sound, 373 
Spherical top, 134 
eigenvalues and degeneracy of, 134 
partition function for, 135 
Spin-orbit interaction, 87 
Spin waves, 219 
Spontaneous process, 45, 46 
Square-well fluid, 308 
Square-well parameters 
and second virial coefficient data, 433 
and viscosity data, 433 
Square-well potential, 235 
Standard free energy, 19, 154 
Standing wave, 178 
State vector, 613 
Stationarity, 496, 509, 539, 554 
Statistical weights, 103, 104, 111 
Stefan- Boltzmann constant, 181, 192 
Stefan-Boltzmann law, 180, 181, 192, 193 
Stirling’s approximation, 21, 32 
Stochastic differential equation, 453 
Stochastic process, 553 
Stokes operator, 381 
Stosszahlansatz (molecular chaos 
assumption), 410, 422, 425 


Stream velocity, 407 
Streaming operator, 409 
Stress tensor, 381, 382, 392, 396, 516 
Strongly degenerate ideal Bose-Einstein gas 
171 

Strongly degenerate ideal Fermi-Dirac gas, 
164 

Structure factor, 260, 291, 300, 301 
from CWA, 318 
molecular dynamics of, 301 
Substantial derivative, 381 
Summational invariant, 366 
Superposition approximation, 267, 280, 281 
290, 292 

Rice-Lekner improvement for, 283, 284 
test of, 280, 281 
time-dependent, 406 
variational principal of, 281 
Sutherland constant, 365 
Sutherland potential, 247 
Symmetric top, 134, 135 
eigenvalue and degeneracy of, 1 35 
partition function for, 135 
polarizability of, 533 
Symmetric wave function, 12 
Symmetry number, 101, 105, 133 
Symmetry of wave functions, 12, 28, 69, 
101, 186 


r 3 -law, 200, 204 

Teller-Redlich product rule, 148, 158 
Tensor, 382 

Thermal conductivity coefficient, 362, 447 
and time correlation function, 521 
reduced, 435, 448, 531, 532, 539 
Thermal diffusion, 429 
Thermal diffusion ratio, 429 
Thermal pressure, 443 
Thermal transport coefficients, 361, 364, 
428, 430, 447 

and collision integrals, 431 
and Chapman-Enskog method, 364 
and density expansion, 440 
and kinetic theory, 364 
reduced, 435, 448, 531, 532, 539 
and rigorous derivation, 579 
and time-correlation functions, 512, 524, 
525 

Thermodynamic formulas 

for canonical ensemble, 44, 45, 58 
for grand canonical ensemble, 54, 55, 58, 
65 

for isothermal-isobaric ensemble, 58, 65 
for microcanonical ensemble, 57, 58 
Thermodynamic functions 
of ideal diatomic gas, 108 
of ideal monatomic gas, 85, 86, 87 
of ideal polyatomic gas, 136, 137 
Thermodynamic functions, table of, 58 
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Thermodynamic functions and partition 
functions, summary of, 58 
Thermodynamic tables, 151-157 
Thermodynamics, review of, 13 
Third law of thermodynamics, 14, 30, 44, 
47, 54, 65, 138, 141 
Third virial coefficient, 237-239 
experimental, 238, 239 
hard-sphere potential and, 240 
Lennard-Jones potential and, 238 
reduced form of, 238 
square-well potential and, 249 
Time-correlation function, 467 
and Chapman-Enskog theory, 525 
and corresponding states, 531, 532, 539 
and Enskog theory, 529 
and integral differential equation, 575 
and mean free path theory, 523 
Time-dependent operator, 472 
Time-displacement operator, 405, 418 
Time-reversal paradox, 415 
Totally symmetric vibration, 487 
Trace of a matrix, 183 
Trajectories in phase space, 118, 123 
Transition probability, 471 
Translational partition function, 81, 82, 89, 
93, 95, 129 

Translational quantum numbers, 82, 89, 1 13 
Transverse waves, 179, 201, 620 
Traveling wave, 177, 178, 201, 621 
Triangle potential, 247 
Two-body problem, 26 
Two-component system, 90, 248 
Two-fluid theories, 294 
Two-state systems, 78, 79 


Uncertainty principle, 9 
Undetermined multipliers, method of, 23, 
40, 43, 53 
Unit tensor, 382 
Unitary operator, 537, 616, 617 
Unperturbed system, 302 
Ursell functions, 251 
Ursell-Mayer expansion, 251 


van der Waals constants, 304, 305, 321, 373 
van der Waals equation of state, 49, 304 
and critical parameters, 321 
generalized, 321 
reduced, 321 

and virial coefficients, 321 
van der Waals-1 theory, 295, 296, 297 
van der Waals-2 theory, 295, 296, 297 
van Hove space-time correlation function, 
545 

distinct, 545 
self, 545, 564 


Vapor pressure, of solids, 220 
Variance, 20 
Vector identities, 396 
Vector potential, 622 
Vector space, 613 
Velocity contours, 395 
Verlet-Weis algorithm, 323 
Vibrational partition function, 96, 130 
Vibrational spectrum, 94, 110, 111 
Vibrational spectrum of a crystal, 194 
Vibrational temperature, 97, 132 
Vibration-rotation spectrum, 104 
Vineyard approximation, 586 
Virial coefficients 
and coulombic potential, 327, 351 
from integral equations, 278, 279, 291 
and intermolecular potential, 226-233 
higher (fourth, fifth, . . .) 239, 240, 241, 
252, 257 

Lennard-Jones potential and, 236 
and mixtures, 341 
of Bose-Einstein gas, 170, 190 
of Fermi-Dirac gas, 163, 190 
quantum corrections and, 241-243 
and rigid spheres, 234, 239-241, 279 
square well potential and, 235 
Virial expansion, 163, 170, 223, 245 
of thermodynamic functions, 248 
and rectangular conduit, 394, 395 
Viscosity coefficient, 361, 385, 386, 431, 447 
determination of, 393 
and mixtures, 449 
reduced, 431, 448, 531, 532, 539 
Viscous flow, 392 
Volume expansion coefficient, 217 
Volume of an TV-dimensional sphere, 12, 28, 
124 


Wave equation, 390, 619, 628 
inhomogeneous, 623 
Wave function, 9 
Wave vector, 178, 201, 207, 628 
Weakly-degenerate ideal Bose-Einstein gas, 
169 

Weakly-degenerate ideal Fermi-Dirac gas, 
162 

White spectrum, 587 
Wiener-Khintchine theorem, 553 


X-ray scattering, 259-260, 300, 546 

Zermelo’s paradox, 416 
Zero moment condition (electroneutrality), 
355 

Zero of energy, 95, 96, 111, 152, 153 

Zero point pressure, 166 

Zeta function, 33, 34, 171, 180, 191 






